#file-reading

[ follow ]
Data science
fromInfoQ
7 hours ago

Redesigning Banking PDF Table Extraction: A Layered Approach with Java

PDF table extraction in enterprise systems is an architectural problem requiring hybrid parsing and machine learning for effective handling.
fromMouse Vs Python
4 days ago

Textual - An Intro to DOM Queries (Part I) - Mouse Vs Python

The query_one() method throughout the Textual documentation allows users to retrieve a single widget that matches a CSS selector or a widget type. You can pass in up to two parameters to query_one(), which are the CSS selector and the widget type, or both at the same time.
jQuery
Node JS
fromRaymondcamden
4 days ago

Summarizing Docs with Built-in AI

On-device summarization of various document types, including Office formats, is achievable using libraries like officeParser and Chrome's Summary API.
Django
fromInfoWorld
4 days ago

Exciting Python features are on the way

Python 3.15 introduces lazy imports, an immutable frozendict, JIT compiler improvements, and enhanced support for WebAssembly.
Productivity
fromMedium
5 days ago

How to prevent "You've hit your limit" when working with Claude Code

Reaching usage limits in Claude Code can hinder productivity, but there are practical tips to manage and reduce this risk.
#python
Python
fromdeath and gravity
6 days ago

Learn Python object-oriented programming with Raymond Hettinger

Subclassing in Python is a technique for code reuse, emphasizing operational views over traditional hierarchical models.
Python
fromdeath and gravity
6 days ago

Learn Python object-oriented programming with Raymond Hettinger

Subclassing in Python is a technique for code reuse, emphasizing operational views over traditional hierarchical models.
Software development
fromArs Technica
4 days ago

New Codex features include the ability to use your computer in the background

OpenAI's Codex desktop app update introduces background task capabilities, scheduling, an in-app browser, and image generation features for enhanced productivity.
Python
fromRealpython
1 week ago

Python Coding With AI (Learning Path) - Real Python

LLM-powered coding tools enhance Python development by assisting in writing, reviewing, and debugging code.
Podcast
fromFast Company
2 weeks ago

3 AI tools that make keeping up with the news easier

Huxe is a personalized audio app that generates custom podcasts based on user interests, calendar, and email.
Software development
fromZDNET
4 days ago

OpenAI's Codex Desktop can run your computer now - and has its own browser

Codex Desktop evolves from coding to broader productivity workflows while still targeting developers.
#structured-data
Data science
fromAol
2 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
2 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Java
fromInfoWorld
3 weeks ago

Basic and advanced Java serialization

Order in custom serialization must match exactly to avoid data corruption or deserialization failure.
#ai
DevOps
fromInfoWorld
4 weeks ago

An architecture for engineering AI context

AI systems must intelligently manage context to ensure accuracy and reliability in real applications.
JavaScript
fromMedium
4 weeks ago

Metaprogramming: teaching the compiler to explain your data.

Contextual information enhances debugging by providing field names, types, and nested structures.
fromRubyflow
4 weeks ago

Can Ruby Read an X-Ray? Building a Medical Image Processor.

The inclusion of Sobel edge detection, Laplacian filter, Gaussian blur, and custom convolution as native C bindings in ruby-libgd proves essential for medical imaging applications.
Django
fromZDNET
3 weeks ago

ChatGPT just got a library for all your files - how it works

ChatGPT now features a dedicated library for storing uploaded and generated files, enhancing accessibility and organization.
Software development
fromInfoQ
2 weeks ago

TigerFS Mounts PostgreSQL Databases as a Filesystem for Developers and AI Agents

TigerFS is an experimental filesystem that integrates PostgreSQL, allowing file operations through a standard filesystem interface.
Python
fromRealpython
4 weeks ago

Understanding CRUD Operations in SQL - Real Python

CRUD operations are essential for creating, reading, updating, and deleting data in applications.
Java
fromInfoWorld
1 month ago

Java future calls for boosts with records, primitives, classes

Java is advancing record creation, primitive boxing, and pattern matching through multiple JEPs to improve code conciseness and type handling capabilities.
Python
fromPycoders
3 weeks ago

PyCoder's Weekly | Issue #727

Jazzband is winding down due to the overwhelming number of AI submissions affecting its cooperative model.
Artificial intelligence
fromZDNET
1 month ago

AI is getting scary good at finding hidden software bugs - even in decades-old code

AI models can effectively identify decades-old bugs in legacy code, but this capability also enables hackers to exploit vulnerabilities in deployed systems.
Python
fromDEV Community
1 month ago

I Analyzed the Readability of 10 Popular Developer Documentation Sites

Readability of developer documentation is crucial; many popular docs score below standard readability levels.
Django
fromRealpython
1 month ago

Introduction to Python SQL Libraries Quiz - Real Python

A 9-question interactive quiz assesses proficiency in Python SQL libraries for database connectivity, query execution, and cross-database scripting with SQLite, MySQL, and PostgreSQL.
fromRealpython
1 month ago

Downloading Files From URLs With Python - Real Python

Python makes it straightforward to download files from a URL with its robust set of libraries. For quick tasks, you can use the built-in urllib module or the requests library to fetch and save files. When working with large files, streaming data in chunks can help save memory and improve performance.
Python
fromMedium
2 months ago

Algorithms Are Just Real Life, Formalized

Which Algorithm Is This? If you step back, this maps almost perfectly to the Top K Frequent Elements problem.We usually solve it for integers in a list. Here, the "elements" are audience profiles age and body-type combinations. First, define what an audience profile looks like: case class Profile(age: Int, height: Int, weight: Int) What we want is a function like this:
Scala
Software development
fromMedium
1 month ago

From Concept to Production: The Complete Guide to XML Prompt Engineering for Software Developers

Treat LLMs as compilers, not chatbots, using Structured XML Prompting to generate professional-grade code through a five-stage AI-assisted development cycle with rigid constraints.
#rag
Deliverability
fromBusiness Matters
1 month ago

9 Ways to Split Up Big Documents into Smaller, Shareable Files

Large documents can be efficiently split into smaller, manageable files using online tools and built-in software features without losing formatting or quality.
Online marketing
fromPractical Ecommerce
1 month ago

AI Bots Don't Need Markdown Pages

Serving Markdown versions of web pages to AI bots resembles cloaking tactics and risks diluting essential signals without proven effectiveness in improving visibility.
Digital life
fromInc
2 months ago

Fed Up With AI Slop? These Platforms Will Let You Dial it Down

Platforms are adding settings to reduce low-quality AI-generated content, but fully eliminating such content from feeds is extremely difficult.
Python
fromRealpython
1 month ago

Working With Files in Python Quiz - Real Python

A 12-question interactive quiz assesses proficiency in Python file and directory operations including opening, iterating, filtering, creating, deleting, copying, and renaming files.
Data science
fromInfoWorld
1 month ago

The revenge of SQL: How a 50-year-old language reinvents itself

SQL has experienced a major comeback driven by SQLite in browsers, improved language tools, and PostgreSQL's jsonb type, making it both traditional and exciting for modern development.
#java
fromInfoQ
2 months ago
Java

Java Explores Carrier Classes to Extend Data-Oriented Programming Beyond Records

fromInfoQ
2 months ago
Java

Java Explores Carrier Classes to Extend Data-Oriented Programming Beyond Records

Python
fromRealpython
1 month ago

Working With APIs in Python: Reading Public Data - Real Python

Python with the Requests library enables efficient consumption of REST, SOAP, and GraphQL APIs through HTTP communication, supporting various authentication methods like API keys and OAuth.
Software development
fromInfoWorld
1 month ago

Coding for agents

AI agents reward explicit, consistent, well-documented code over clever or personally-preferred approaches, fundamentally changing software engineering standards toward machine-legibility.
Artificial intelligence
fromInfoQ
2 months ago

Working with Code Assistants: The Skeleton Architecture

Combining Vertical Slice architecture with Dependency Inversion and a Skeleton of base classes constrains AI code assistants' context, producing safer, consistent, and maintainable generated code.
Software development
fromBusiness Matters
1 month ago

AI Document Processing Software for UK SMEs

UK small business owners waste 120 hours annually on document admin; AI processing software eliminates errors, reduces costs, and frees staff for revenue-generating work.
Python
fromRealpython
1 month ago

Python Stacks, Queues, and Priority Queues in Practice Quiz - Real Python

An 11-question quiz tests understanding of Python stacks, queues, priority queues, and their real-world applications in traversal and system architecture.
fromEntrepreneur
1 month ago

This AI Assistant Runs Entirely on Your Computer With No Monthly Fees

It's no secret that businesses are increasingly concerned about artificial intelligence (AI) privacy and escalating subscription costs. Many entrepreneurs find themselves locked into expensive monthly AI services while worrying about where their sensitive business data ends up. Pansophy is an AI desktop assistant that offers a different approach entirely, and a lifetime subscription is available now for only $59.97 (reg. $199).
Artificial intelligence
Software development
fromInfoQ
1 month ago

New Research Reassesses the Value of AGENTS.md Files for AI Coding

AGENTS.md context files often hinder rather than help AI coding agents, with LLM-generated files reducing task success rates by 3% and increasing costs by over 20%.
fromThe Verge
2 months ago

ChatGPT's deep research tool adds a built-in document viewer so you can read its reports

OpenAI is updating ChatGPT's deep research tool with a full-screen viewer that you can use to scroll through and navigate to specific areas of its AI-generated reports. As shown in a video shared by OpenAI, the built-in viewer allows you to open ChatGPT's reports in a window separate from your chat, while showing a table of contents on the left side of the screen, and a list of sources on the right.
Artificial intelligence
Artificial intelligence
fromZDNET
1 month ago

7 AI coding techniques that quietly make you elite

Agentic AI tools make a single developer far more productive, enabling rapid cross-platform product creation by encoding design systems, user profiles, and permanent bug lessons.
Python
fromRealpython
2 months ago

Python's pathlib Module: Taming the File System Quiz - Real Python

Use Python's pathlib module to perform common filesystem operations like path joining, iteration, file creation, writing, renaming, and extracting filename components.
Artificial intelligence
fromInfoQ
2 months ago

OpenAI Begins Article Series on Codex CLI Internals

Codex harness runs an LLM-driven agent loop that assembles prompts, manages context to reduce prompt cache misses, and invokes tools via the Responses API.
Software development
fromTreehouse Blog
2 months ago

Intermediate Python: Files, Packages, and CLI Apps

Intermediate Python enables building file-based, package-enabled, command-line programs that automate tasks, process data, and integrate into real workflows.
Python
fromThepythoncodingstack
2 months ago

Planning Meals, Weekly Shop, Alternative Constructors Using Class Methods

Use alternative constructors via class methods to build Meal and WeeklyMealPlanner objects and protect internal state with read-only properties and shallow copies.
fromInfoWorld
2 months ago

How to use Pandas for data analysis in Python

When it comes to working with data in a tabular form, most people reach for a spreadsheet. That's not a bad choice: Microsoft Excel and similar programs are familiar and loaded with functionality for massaging tables of data. But what if you want more control, precision, and power than Excel alone delivers? In that case, the open source Pandas library for Python might be what you are looking for.
Python
Python
fromPythonmorsels
2 months ago

All iteration is the same in Python

All Python iteration forms use the same iterator protocol; iterating over an object yields its defined iteration items (e.g., dictionary yields keys, string yields characters).
Software development
fromInfoQ
2 months ago

Are You Missing a Data Frame? The Power of Data Frames in Java

DataFrames and data-oriented programming promote modeling immutable data separately from behavior, making Java suitable for DataFrame-style data manipulation comparable to Python.
Software development
fromO'Reilly Media
1 month ago

How to Write a Good Spec for AI Agents

Write smart AI agent specs with clear high-level vision, break tasks into smaller chunks, plan in read-only mode first, then execute and iterate continuously while staying within practical context limits.
fromInfoWorld
2 months ago

AI is changing the way we think about databases

Developers have spent the past decade trying to forget databases exist. Not literally, of course. We still store petabytes. But for the average developer, the database became an implementation detail; an essential but staid utility layer we worked hard not to think about. We abstracted it behind object-relational mappers (ORM). We wrapped it in APIs. We stuffed semi-structured objects into columns and told ourselves it was flexible.
Software development
#brotli
Software development
fromZDNET
2 months ago

I used GPT-5.2-Codex to find a mystery bug and hosting nightmare - it was beyond fast

A $20 ChatGPT Plus plan with Codex can diagnose and fix real-world WordPress plugin bugs and hosting issues, saving developer time.
[ Load more ]