#kuzudb

[ follow ]
fromTheregister
4 days ago

DuckDB uses RDBMS to tackle lakehouse 'small changes' issue

You make a small change to your table, adding a single row, and it affects data lake performance because, due to the way they work, a new file has to be written that contains one row, and then a bunch of metadata has to be written. This is very inefficient, because formats like Parquet really don't want to store a single row, they want to store a million rows.
Data science
Scala
fromInfoQ
4 days ago

Lakehouse Tower of Babel: Handling Identifier Resolution Rules Across Database Engines

Open table formats standardize data semantics but lack SQL dialect interoperability, complicating identifier resolution across different engines.
#postgresql
DevOps
fromInfoQ
1 week ago

Google Cloud Highlights Ongoing Work on PostgreSQL Core Capabilities

Google Cloud has made significant technical contributions to PostgreSQL, enhancing logical replication, upgrade processes, and system stability.
Data science
fromInfoWorld
1 week ago

Google Cloud introduces QueryData to help AI agents create reliable database queries

QueryData enhances AI agents' accuracy in querying databases by translating natural language into precise database queries.
Software development
fromInfoQ
2 weeks ago

TigerFS Mounts PostgreSQL Databases as a Filesystem for Developers and AI Agents

TigerFS is an experimental filesystem that integrates PostgreSQL, allowing file operations through a standard filesystem interface.
fromInfoWorld
1 week ago

Bringing databases and Kubernetes together

Automating Kubernetes workloads with Operators can provide the same level of functionality as DBaaS, while still avoiding lock-in to a specific provider.
DevOps
DevOps
fromInfoQ
1 week ago

Etsy Migrates 1000-Shard, 425 TB MySQL Sharding Architecture to Vitess

Etsy migrated its MySQL sharding infrastructure to Vitess, enhancing data management and enabling resharding capabilities.
Node JS
fromhowtocenterdiv.com
3 weeks ago

Database Performance Bottlenecks: N+1 Queries, Missing Indexes, and Connection Pools

Database issues, like missing indexes and N+1 queries, are often overlooked in software engineering, leading to persistent performance problems.
Django
fromInfoQ
3 weeks ago

Open Source, Community, and Consequence: The Story of MongoDB

The keynote focuses on the document model and the community behind MongoDB, rather than the database itself.
DevOps
fromTheregister
1 week ago

AWS put a file system on S3; I stress-tested it

AWS S3 Files allows mounting S3 buckets as NFS shares, providing solid conflict resolution and cost-effective storage options.
Python
fromRealpython
4 weeks ago

Understanding CRUD Operations in SQL - Real Python

CRUD operations are essential for creating, reading, updating, and deleting data in applications.
fromDjangocentral
3 weeks ago

How to Perform AND Queries in Django ORM

The simplest way to perform an AND query in Django ORM is by chaining multiple filter() methods. Each subsequent filter narrows down the queryset further by adding an additional condition.
DevOps
fromInfoQ
2 weeks ago

Replacing Database Sequences at Scale Without Breaking 100+ Services

Validating requirements can simplify complex problems, and embedding sequence generation reduces network calls, enhancing performance and reliability.
Business intelligence
fromTheregister
1 month ago

Microsoft promises multi database wrangling hub on Fabric

Microsoft launched Database Hub, a unified management tool within Fabric that consolidates multiple database services across on-premises, PaaS, and SaaS environments with AI-assisted capabilities.
Scala
fromInfoQ
1 month ago

QCon London 2026: Introducing Tansu.io -- Rethinking Kafka for Lean Operations

Tansu is an open-source, stateless messaging broker that replaces Kafka's complex architecture with a simpler, durable storage model.
DevOps
fromInfoQ
3 weeks ago

ProxySQL Introduces Multi-Tier Release Strategy With Stable, Innovative, and AI Tracks

ProxySQL 3.0.6 introduces a multi-tier release strategy focusing on stability, innovation, and AI capabilities for diverse user needs.
Business intelligence
fromInfoWorld
1 month ago

Why Postgres has won as the de facto database: Today and for the agentic future

Leading enterprises achieve 5x ROI by adopting open source databases like PostgreSQL to unify structured and unstructured data for agentic AI, with 81% of successful enterprises committed to open source strategies.
fromInfoWorld
1 month ago

How to create AI agents with Neo4j Aura Agent

Neo4j Aura Agent is an end-to-end platform for creating agents, connecting them to knowledge graphs, and deploying to production in minutes. In this post, we'll explore the features of Neo4j Aura Agent that make this all possible, along with links to coded examples to get hands-on with the platform.
Data science
DevOps
fromTechzine Global
3 weeks ago

OpenObserve lowers observability storage costs by 140x

OpenObserve offers an AI-native open source platform that significantly reduces costs and infrastructure needs in the observability market.
Data science
fromMedium
1 month ago

Migrating to the Lakehouse Without the Big Bang: An Incremental Approach

Query federation enables safe, incremental lakehouse migration by allowing simultaneous queries across legacy warehouses and new lakehouse systems without risky big bang cutover approaches.
#mariadb
DevOps
fromInfoQ
1 month ago

AWS Expands Aurora DSQL with Playground, New Tool Integrations, and Driver Connectors

Amazon Aurora DSQL introduces usability enhancements, including a browser-based playground and integrations with popular SQL tools for improved developer experience.
#ai
#mariadb-acquisition
Business intelligence
fromInfoWorld
1 month ago

MariaDB taps GridGain to keep pace with AI-driven data demands

MariaDB's acquisition of GridGain aims to create an integrated platform combining relational database reliability with in-memory computing speed to compete with hyperscaler offerings.
Business intelligence
fromInfoWorld
1 month ago

MariaDB taps GridGain to keep pace with AI-driven data demands

MariaDB's acquisition of GridGain aims to create an integrated platform combining relational database reliability with in-memory computing speed to compete with hyperscaler offerings.
Django
fromRealpython
1 month ago

Introduction to Python SQL Libraries Quiz - Real Python

A 9-question interactive quiz assesses proficiency in Python SQL libraries for database connectivity, query execution, and cross-database scripting with SQLite, MySQL, and PostgreSQL.
#mysql
Artificial intelligence
fromInfoQ
2 months ago

MongoDB Introduces Embedding and Reranking API on Atlas

MongoDB Atlas now offers an Embedding and Reranking API with Voyage AI models, enabling unified semantic search, automated embeddings, and integrated monitoring and billing.
Software development
fromInfoWorld
2 months ago

4 self-contained databases for your apps

XAMPP provides a complete local web stack (MariaDB, Apache, PHP, Mercury SMTP, OpenSSL) while PostgreSQL can be run standalone or embedded via pgserver in Python.
DevOps
fromTheregister
1 month ago

AWS spurs Catch-22, ending PostgreSQL 13 support for RDS

AWS RDS PostgreSQL 13 end of support forces upgrades to PostgreSQL 14+, but this breaks AWS Glue ETL service due to incompatible authentication schemes, creating a production environment conflict.
fromRaymondcamden
2 months ago

I threw thousands of files at Astro and you won't believe what happened next...

I began by creating a soft link locally from my blog's repo of posts to the src/pages/posts of a new Astro site. My blog currently has 6742 posts (all high quality I assure you). Each one looks like so: --- layout: post title: "Creating Reddit Summaries with URL Context and Gemini" date: "2026-02-09T18:00:00" categories: ["development"] tags: ["python","generative ai"] banner_image: /images/banners/cat_on_papers2.jpg permalink: /2026/02/09/creating-reddit-summaries-with-gemini description: Using Gemini APIs to create a summary of a subreddit. --- Interesting content no one will probably read here...
Austin
Web development
fromInfoWorld
1 month ago

The browser is your database: Local-first comes of age

Local-first technologies like PGlite and RxDB embed feature-rich databases directly in browsers, enabling instant interactivity while maintaining server synchronization through background sync engines.
Software development
fromMedium
2 months ago

The Complete Database Scaling Playbook: From 1 to 10,000 Queries Per Second

Database scaling to 10,000 QPS requires staged architectural strategies timed to traffic thresholds to avoid outages or unnecessary cost.
DevOps
fromInfoWorld
1 month ago

Update your databases now to avoid data debt

Multiple major open source databases reach end-of-life in 2026, requiring teams to plan upgrades and migrations to avoid security risks and higher costs.
Tech industry
fromInfoQ
2 months ago

Google Introduces Managed Connection Pooling for AlloyDB

AlloyDB's managed connection pooling increases client connections and transactional throughput while reducing operational burden and latency for high-concurrency and serverless workloads.
fromDbmaestro
5 years ago

5 Pillars of Database Compliance Automation |

There is a growing emphasis on database compliance today due to the stricter enforcement of compliance rules and regulations to safeguard user privacy. For example, GDPR fines can reach £17.5 million or 4% of annual global turnover (the higher of the two applies). Besides the direct monetary implications, companies also need to prioritize compliance to protect their brand reputation and achieve growth.
EU data protection
Python
fromRealpython
2 months ago

TinyDB: A Lightweight JSON Database for Small Projects - Real Python

TinyDB is a lightweight, pure-Python, document-oriented NoSQL database storing JSON documents and offering simple CRUD for small, local, single-process projects.
fromPaolo Melchiorre
1 month ago

Django ORM Standalone: Querying an existing database

For a long time I wanted to document something I have done many times in production systems but never explained clearly: using Django ORM as a standalone module to connect to an existing database. In my work I have often dealt with legacy systems where the only reliable source of truth was the database itself. In those situations, Django ORM became my Swiss army knife.
Web frameworks
#dynamodb
Data science
fromInfoWorld
1 month ago

The revenge of SQL: How a 50-year-old language reinvents itself

SQL has experienced a major comeback driven by SQLite in browsers, improved language tools, and PostgreSQL's jsonb type, making it both traditional and exciting for modern development.
DevOps
fromInfoQ
1 month ago

From Minutes to Seconds: Uber Boosts MySQL Cluster Uptime with Consensus Architecture

Uber redesigned MySQL infrastructure using Group Replication to reduce failover time from minutes to seconds while maintaining strong consistency across thousands of clusters.
Tech industry
fromInfoQ
2 months ago

Expired Oracle Patent Opens Fast Sorting Algorithm to Open Source Databases

An expired Oracle patent enables open-source databases to implement an adaptive "Orasort" that speeds sorting of similar keys by skipping common prefixes and caching substrings.
DevOps
fromInfoQ
1 month ago

Netflix Automates RDS PostgreSQL to Aurora PostgreSQL Migration Across 400 Production Clusters

Netflix automated RDS to Aurora PostgreSQL migrations across 400 production clusters through infrastructure-level orchestration, eliminating manual intervention while maintaining data integrity and CDC pipeline correctness.
fromInfoWorld
2 months ago

What's new in MySQL 9.0

But it still contains useful things and can be upgraded to from MySQL 8.4 LTS; the MySQL Configurator automatically does the upgrade without user intervention during MSI installations on Windows. The major changes include: A new Vector datatype is supported in CREATE and ALTER statements. JavaScript Stored Programs, which support JavaScript-based stored programs and functions, has come to MySQL Enterprise Edition. JavaScript Stored Programs can call SQL, and SQL can call them.
Software development
Data science
fromInfoQ
1 month ago

Databricks Introduces Lakebase, a PostgreSQL Database for AI Workloads

Databricks Lakebase is a serverless PostgreSQL OLTP database that separates compute from storage and unifies transactional and analytical capabilities.
Software development
fromDbmaestro
4 years ago

If You Don't Have Database Delivery Automation, Brace Yourself for These 10 Problems |

Manual database processes break DevOps pipelines; only 12% deploy database changes daily, causing configuration drift, frequent errors, slower time-to-market, and reduced productivity.
Artificial intelligence
fromInfoWorld
2 months ago

Five MCP servers to rule the cloud

Major cloud providers now offer official MCP servers that let AI agents automate cloud operations using existing cloud credentials and natural language commands.
Data science
fromInfoWorld
1 month ago

Buyer's guide: Comparing the leading cloud data platforms

Five leading cloud data platforms—Databricks, Snowflake, Amazon RedShift, Google BigQuery, and Microsoft Fabric—offer distinct architectural approaches for enterprise data storage, analytics, and AI workloads.
DevOps
fromInfoQ
1 month ago

Google BigQuery Previews Cross-Region SQL Queries for Distributed Data

BigQuery's global queries feature enables SQL queries across multiple geographic regions without data movement, eliminating ETL pipelines for distributed analytics.
Software development
fromInfoWorld
2 months ago

Why your next microservices should be streaming SQL-driven

Streaming SQL with UDFs, materialized results, and ML/AI integrations enables continuous, stateful processing of event streams for microservices.
Artificial intelligence
fromInfoWorld
1 month ago

Why AI requires rethinking the storage-compute divide

AI workloads require continuous processing of unstructured multimodal data, causing redundant data movement and transformation that wastes infrastructure costs and data scientist time.
fromInfoWorld
2 months ago

AI is changing the way we think about databases

Developers have spent the past decade trying to forget databases exist. Not literally, of course. We still store petabytes. But for the average developer, the database became an implementation detail; an essential but staid utility layer we worked hard not to think about. We abstracted it behind object-relational mappers (ORM). We wrapped it in APIs. We stuffed semi-structured objects into columns and told ourselves it was flexible.
Software development
fromInfoQ
2 months ago

Firestore Adds Pipeline Operations with Over 100 New Query Features

Google has overhauled Firestore Enterprise edition's query engine, adding Pipeline operations that let developers chain together multiple query stages for complex aggregations, array operations, and regex matching. The update removes Firestore's longstanding query limitations and makes indexes optional, putting the database on par with other major NoSQL platforms. Pipeline operations work through sequential stages that transform data inside the database.
Software development
Software development
fromInfoQ
2 months ago

OpenAI Scales Single Primary Postgresql to Millions of Queries per Second for ChatGPT

OpenAI scaled a single-primary PostgreSQL to millions of queries per second by optimizing instance size, query patterns, read replicas, and offloading write-heavy workloads.
fromDbmaestro
5 years ago

Database Delivery Automation in the Multi-Cloud World

The main advantage of going the Multi-Cloud way is that organizations can "put their eggs in different baskets" and be more versatile in their approach to how they do things. For example, they can mix it up and opt for a cloud-based Platform-as-a-Service (PaaS) solution when it comes to the database, while going the Software-as-a-Service (SaaS) route for their application endeavors.
DevOps
Software development
fromMedium
2 months ago

When Kafka Lag Lies: A Production Debugging Story

Uncommitted Kafka offsets can cause persistent consumer-group lag even when ingestion is low, databases are idle, and no errors are observed.
#clickhouse
fromTechzine Global
2 months ago

Databricks makes serverless Postgress service Lakebase available

Databricks today announced the general availability of Lakebase on AWS, a new database architecture that separates compute and storage. The managed serverless Postgres service is designed to help organizations build faster without worrying about infrastructure management. When databases link compute and storage, every query must use the same CPU and memory resources. This can cause a single heavy query to affect all other operations. By separating compute and storage, resources automatically scale with the actual load.
Software development
Software development
fromInfoQ
2 months ago

[Video Podcast] Building Resilient Event-Driven Microservices in Financial Systems with Muzeeb Mohammad

Event-driven architectures using Kafka enable decoupling backend workflows, improving scalability and SLAs for complex multi-system processes like account opening.
fromInfoWorld
2 months ago

The 'Super Bowl' standard: Architecting distributed systems for massive concurrency

When I manage infrastructure for major events (whether it is the Olympics, a Premier League match or a season finale) I am dealing with a "thundering herd" problem that few systems ever face. Millions of users log in, browse and hit "play" within the same three-minute window. But this challenge isn't unique to media. It is the same nightmare that keeps e-commerce CTOs awake before Black Friday or financial systems architects up during a market crash. The fundamental problem is always the same: How do you survive when demand exceeds capacity by an order of magnitude?
DevOps
fromMedium
3 months ago

How I Fixed a Critical Spark Production Performance Issue (and Cut Runtime by 70%)

"The job didn't fail. It just... never finished." That was the worst part. No errors.No stack traces.Just a Spark job running forever in production - blocking downstream pipelines, delaying reports, and waking up-on-call engineers at 2 AM. This is the story of how I diagnosed a real Spark performance issue in production and fixed it drastically, not by adding more machines - but by understanding Spark properly.
fromInfoQ
2 months ago

VillageSQL Launches as an Extension-Focused MySQL Fork

A new open-source project, VillageSQL, has been introduced as a tracking fork of MySQL aimed at expanding extensibility and addressing feature gaps increasingly relevant to AI and agent-based workloads. Announced by founder Dominic Preuss, VillageSQL Server for MySQL is positioned as a drop-in replacement that maintains compatibility with upstream MySQL while adding a structured extension framework. The alpha release is now available for experimentation.
Software development
[ Load more ]