#etl-pipelines

[ follow ]
Business intelligence
fromDevOps.com
2 hours ago

Dynatrace to Acquire Bindplane to Process and Route Telemetry Data - DevOps.com

Dynatrace is acquiring Bindplane to enhance log management and allow pre-processing of telemetry data for cost reduction and workflow efficiency.
#ai
DevOps
fromDevOps.com
5 hours ago

CloudBees Delivers on AI Promise to Improve Application Testing - DevOps.com

CloudBees Smart Tests uses AI to prioritize tests, reducing CI/CD processing time significantly.
Data science
fromTheregister
2 weeks ago

Datadog bets DIY AI will mean it dodges the SaaSpocalypse

Datadog is releasing an AI model to enhance its observability tools and mitigate risks from customers building their own solutions.
Web frameworks
fromInfoQ
20 hours ago

Tiger Teams, Evals and Agents: The New AI Engineering Playbook

Sam Bhagwat is a co-founder and CEO of Mastra, an open source JavaScript/Typescript framework for building AI agents.
Data science
fromFast Company
3 days ago

Data, not infrastructure, must drive your AI strategy

Data centricity is essential for effective AI strategies, enabling collaboration and problem-solving across business units by making data accessible.
Artificial intelligence
fromComputerworld
2 days ago

AI often doesn't deliver ROI for IT departments either

Only 28% of AI projects in infrastructure and operations achieve meaningful ROI, with many failing due to unrealistic expectations and skills gaps.
Podcast
fromFast Company
4 days ago

3 AI tools that make keeping up with the news easier

Huxe is a personalized audio app that generates custom podcasts based on user interests, calendar, and email.
Online learning
fromeLearning Industry
1 week ago

From Manual To Intelligent: How AI Automation Is Reshaping L&D Operations

AI automation can alleviate operational burdens on L&D teams, allowing them to focus on strategic tasks and improve learning quality.
Business intelligence
fromZDNET
1 day ago

I asked 5 data leaders about how they use AI to automate - and end integration nightmares

Strong processes and AI integration are essential for businesses to effectively utilize data.
#snowflake
Django
fromMedium
1 week ago

Snowflake Supports Directory Imports

Easier package imports into Snowflake functions and procedures from stage directories and SnowGit directories streamline development and deployment.
Artificial intelligence
fromTheregister
2 weeks ago

Snowflake's ongoing pitch: bring AI to data, not vice versa

Snowflake is enhancing its platform for AI integration through strategic partnerships and acquisitions, focusing on customer ROI and data management efficiency.
Django
fromMedium
1 week ago

Snowflake Supports Directory Imports

Easier package imports into Snowflake functions and procedures from stage directories and SnowGit directories streamline development and deployment.
Artificial intelligence
fromTheregister
2 weeks ago

Snowflake's ongoing pitch: bring AI to data, not vice versa

Snowflake is enhancing its platform for AI integration through strategic partnerships and acquisitions, focusing on customer ROI and data management efficiency.
DevOps
fromTechzine Global
11 hours ago

AWS launches Agent Registry for managing AI agents

AWS introduces the Agent Registry to centralize AI agent management and reduce chaos in organizations deploying numerous agents.
fromInfoWorld
1 week ago

How Apache Kafka flexed to support queues

Apache Kafka has cemented itself as the de facto platform for event streaming, often referred to as the 'universal data substrate' due to its extensive ecosystem that enables connectivity and processing capabilities.
Scala
Software development
fromTechzine Global
1 week ago

The ERP that doesn't care which AI you use, and why that's smart

NetSuite announced three new AI Connector Service extensions, emphasizing a strategic shift towards openness and integration with external AI models.
fromTechzine Global
13 hours ago

Cisco strengthens AI observability Splunk by acquiring Galileo

Galileo provides AI teams with tools to evaluate the quality of AI outputs, detect errors before they reach end users, and continuously improve the behavior of AI agents in production.
DevOps
Scala
fromMedium
1 week ago

Data Extraction and Classification Using Structural Pattern Matching in Scala

Scala pattern matching enhances code readability and extensibility in real-world data engineering use cases.
DevOps
fromInfoQ
1 day ago

Uber's Hive Federation Decentralizes 16K Datasets and 10+ PB for Zero-Downtime Analytics at Scale

Uber redesigned its Hive data warehouse to decentralize datasets, enhancing scalability, security, and operational autonomy for teams.
#artificial-intelligence
Marketing tech
fromBusiness Matters
3 weeks ago

How Artificial Intelligence Is Changing Business Service Delivery

Artificial intelligence transforms business services delivery by automating workflows, improving efficiency, and enhancing service quality in link-building, digital PR, and SEO operations.
Marketing tech
fromBusiness Matters
3 weeks ago

How Artificial Intelligence Is Changing Business Service Delivery

Artificial intelligence transforms business services delivery by automating workflows, improving efficiency, and enhancing service quality in link-building, digital PR, and SEO operations.
fromInfoWorld
2 weeks ago

Databricks pitches Lakewatch as a cheaper SIEM - but is it really?

"This is more likely to complement existing SIEMs than replace them. Early adoption will come from large enterprises already committed to Databricks, especially those seeking flexibility or cost control."
Information security
#agentic-ai
Data science
fromTechzine Global
2 weeks ago

SAP and ODI are working on the IDEA AI-ready data infrastructure

The IDEA program by ODI and SAP aims to make organizational data infrastructure AI-ready through governance, research, and community collaboration.
Information security
fromTechzine Global
2 weeks ago

Databricks launches Lakewatch: agentic SIEM on the Lakehouse

Lakewatch is an open SIEM platform that consolidates security, IT, and business data, enabling rapid threat detection and response using AI agents.
Data science
fromTechzine Global
2 weeks ago

SAP and ODI are working on the IDEA AI-ready data infrastructure

The IDEA program by ODI and SAP aims to make organizational data infrastructure AI-ready through governance, research, and community collaboration.
Information security
fromTechzine Global
2 weeks ago

Databricks launches Lakewatch: agentic SIEM on the Lakehouse

Lakewatch is an open SIEM platform that consolidates security, IT, and business data, enabling rapid threat detection and response using AI agents.
fromInfoWorld
1 day ago

Bringing databases and Kubernetes together

Automating Kubernetes workloads with Operators can provide the same level of functionality as DBaaS, while still avoiding lock-in to a specific provider.
DevOps
fromTechzine Global
2 days ago

AWS S3 buckets now support file systems

S3 Files is built on Amazon EFS and automatically translates file system operations into S3 requests, allowing applications to work with S3 data without code changes.
DevOps
fromInfoWorld
3 weeks ago

Migrating from Apache Airflow v2 to v3

Airflow 3 represents a clear architectural direction for the project: API-driven execution, better isolation, data-aware scheduling and a platform designed for modern scale. While Airflow 2.x is still widely used, it is clearly moving toward long-term maintenance (end-of-life April 2026) with most innovation and architectural investment happening in the 3.x line.
Software development
DevOps
fromInfoWorld
2 days ago

AWS turns its S3 storage service into a file system for AI agents

S3 Files simplifies access to Amazon S3, enhancing its role as a primary data layer for AI and modern applications.
DevOps
fromDevOps.com
3 days ago

Apica Extends Scope and Reach of Platform for Managing Telemetry Data - DevOps.com

Apica's Ascent platform update enhances telemetry data management for DevOps teams, improving observability and cost control.
Business intelligence
fromTheregister
1 week ago

Microsoft Fabric Database Hub dubbed 'partial' solution

Microsoft's Fabric Database Hub offers a centralized management solution for its database services but lacks support for non-Microsoft databases.
Data science
fromMedium
3 weeks ago

Building Consistent Data Foundations at Scale

Building consistent data foundations through intentional architecture, engineering, and governance is essential to prevent fragmentation, support AI adoption, ensure regulatory compliance, and enable reliable organizational decisions at scale.
DevOps
fromInfoWorld
3 days ago

What enterprise devops teams should learn from SaaS

Enterprise devops teams can enhance resiliency by adopting practices from SaaS providers, focusing on robust testing, monitoring, and seamless upgrades.
Business intelligence
fromTheregister
1 week ago

SAP eyes broader data integration with Reltio acquisition

SAP is acquiring Reltio to enhance data integration and support its AI platform with improved master data management capabilities.
#ai-automation
Artificial intelligence
fromTechzine Global
3 weeks ago

Snowflake's Project SnowWork targets autonomous enterprise AI

Snowflake launches Project SnowWork, an autonomous AI interface that performs enterprise tasks like forecasts and reports without data team involvement, expanding from backend infrastructure to front-office productivity tool.
fromInfoWorld
4 weeks ago
Artificial intelligence

Databricks launches Genie Code to automate data science and engineering tasks

Artificial intelligence
fromTechzine Global
3 weeks ago

Snowflake's Project SnowWork targets autonomous enterprise AI

Snowflake launches Project SnowWork, an autonomous AI interface that performs enterprise tasks like forecasts and reports without data team involvement, expanding from backend infrastructure to front-office productivity tool.
fromInfoWorld
4 weeks ago
Artificial intelligence

Databricks launches Genie Code to automate data science and engineering tasks

Data science
fromMedium
1 month ago

Migrating to the Lakehouse Without the Big Bang: An Incremental Approach

Query federation enables safe, incremental lakehouse migration by allowing simultaneous queries across legacy warehouses and new lakehouse systems without risky big bang cutover approaches.
Software development
fromMedium
1 month ago

Unified Databricks Repository for Scala and Python Data Pipelines

Databricks repositories require structured setup with Gradle for multi-language support, dependency management, and version control to scale beyond manual notebook maintenance.
#observability
DevOps
fromTechzine Global
1 week ago

Observability warehouses, the next structural evolution for telemetry

Observability is essential for real-time insights in cloud systems, helping to reduce downtime and improve performance.
DevOps
fromTechzine Global
1 week ago

Observability warehouses, the next structural evolution for telemetry

Observability is essential for real-time insights in cloud systems, helping to reduce downtime and improve performance.
DevOps
fromInfoQ
2 weeks ago

Uber Launches IngestionNext: Streaming-First Data Lake Cuts Latency and Compute by 25%

Uber's IngestionNext platform shifts to a streaming-first system, reducing data ingestion latency from hours to minutes for analytics and machine learning.
Business intelligence
fromInfoWorld
3 weeks ago

Snowflake's new 'autonomous' AI layer aims to do the work, not just answer questions

Project SnowWork is Snowflake's autonomous AI layer that automates data analysis tasks like forecasting, churn analysis, and report generation without requiring data team intervention.
Data science
fromMedium
1 month ago

100 Scala Interview Questions and Answers for Data Engineers

Structured Scala and Apache Spark interview preparation requires understanding distributed systems, performance trade-offs, and pipeline design beyond theoretical knowledge.
Business intelligence
fromTechzine Global
3 weeks ago

Microsoft expands Fabric with Database Hub and SQL migration assistant

Microsoft's Database Hub provides centralized management for multiple database services including Azure SQL, Cosmos DB, PostgreSQL, and MySQL, with migration assistance and intelligent agent recommendations.
Startup companies
fromInfoQ
2 months ago

Etleap Launches Iceberg Pipeline Platform to Simplify Enterprise Adoption of Apache Iceberg

Managed Iceberg pipeline platform unifies ingestion, transformation, orchestration, and table operations inside customers' VPCs, enabling enterprise Iceberg adoption without building custom stacks.
DevOps
fromInfoWorld
3 weeks ago

Update your databases now to avoid data debt

Multiple major open source databases reach end-of-life in 2026, requiring teams to plan upgrades and migrations to avoid security risks and higher costs.
Data science
fromInfoWorld
1 month ago

The revenge of SQL: How a 50-year-old language reinvents itself

SQL has experienced a major comeback driven by SQLite in browsers, improved language tools, and PostgreSQL's jsonb type, making it both traditional and exciting for modern development.
Artificial intelligence
fromInfoWorld
1 month ago

Why AI requires rethinking the storage-compute divide

AI workloads require continuous processing of unstructured multimodal data, causing redundant data movement and transformation that wastes infrastructure costs and data scientist time.
fromTechzine Global
2 months ago

4 steps to create a future-proof data infrastructure

A future-proof IT infrastructure is often positioned as a universal solution that can withstand any change. However, such a solution does not exist. Nevertheless, future-proofing is an important concept for IT leaders navigating continuous technological developments and security risks, all while ensuring that daily business operations continue. The challenge is finding a balance between reactive problem solving and proactive planning, because overlooking a change can cost your organization. So, how do you successfully prepare for the future without that one-size-fits-all solution?
Tech industry
fromTechzine Global
4 weeks ago

DataBahn and Microsoft accelerate SIEM deployment through integration

DataBahn's AI-driven connectors automatically normalize, enrich, and route telemetry from more than 500 sources to Microsoft Sentinel. DataBahn's Cruz AI engine determines which data to send to the analytics tier and which to the Sentinel data lake for long-term storage. Customers report cost savings of up to 60 percent on Sentinel ingestion thanks to this intelligent tiering mechanism.
Business intelligence
fromPyImageSearch
2 months ago

Integrating Streamlit with Snowflake for Live Cloud Data Apps (Part 2) - PyImageSearch

To learn how to query, visualize, and export live warehouse data from Streamlit, just keep reading. Now that we've prepared our helper modules and configured Snowflake credentials, it's time to bring everything together into one cohesive Streamlit app. The main driver script, lesson3_main.py, acts as the command center - defining layout, navigation, and page logic. It connects Streamlit's interactive UI to the Snowflake data warehouse and orchestrates how users query, explore, visualize, and export results.
Python
Miscellaneous
fromTechzine Global
2 months ago

Klarrio uses open source expertise to build foundational data platforms

Klarrio builds compliant, scalable open-source data platforms and platform-engineering foundations, integrating and securing underlying infrastructure so customers can focus on analytics and data science.
Business intelligence
fromTechzine Global
1 month ago

Dataiku introduces platform for scalable enterprise AI

Dataiku launches Platform for AI Success with three new products designed to move AI initiatives from pilots to measurable business outcomes through unified orchestration across cloud providers.
DevOps
fromDeveloper Tech News
4 weeks ago

BMC: Integrating mainframe systems into modern CI/CD pipelines

Mainframe systems must integrate into modern CI/CD pipelines to accelerate delivery while maintaining reliability, replacing legacy Waterfall approaches that prioritize stability over speed.
Tech industry
fromTheregister
2 months ago

Snowflake plugs PostgreSQL into its AI Data Cloud

Snowflake now offers a native PostgreSQL DBaaS in its AI Data Cloud to run transactional workloads alongside analytics and AI under unified governance.
fromTechzine Global
2 months ago

Sumo Logic launches data pipeline apps for Snowflake and Databricks

Snowflake offers a fully managed data platform, but Sumo Logic users often lack insight into performance, login activity, and operational health. The Sumo Logic Snowflake Logs App analyzes login and access activity to identify anomalies or suspicious behavior. It also optimizes data pipelines with insights into long-running or failing queries. Teams can centralize log data to facilitate correlation across applications, cloud services, and data platforms.
Information security
fromInfoWorld
2 months ago

AI is changing the way we think about databases

Developers have spent the past decade trying to forget databases exist. Not literally, of course. We still store petabytes. But for the average developer, the database became an implementation detail; an essential but staid utility layer we worked hard not to think about. We abstracted it behind object-relational mappers (ORM). We wrapped it in APIs. We stuffed semi-structured objects into columns and told ourselves it was flexible.
Software development
Data science
fromTechzine Global
1 month ago

Ataccama puts agentic data observability into platform core

Ataccama ONE introduces Agentic Data Observability technology to ensure high-quality, reliable data for AI systems while preventing autonomous errors and bias in regulated enterprises.
fromDbmaestro
4 years ago

What is Database Delivery Automation and Why Do You Need It?

Manual database deployment means longer release times. Database specialists have to spend several working days prior to release writing and testing scripts which in itself leads to prolonged deployment cycles and less time for testing. As a result, applications are not released on time and customers are not receiving the latest updates and bug fixes. Manual work inevitably results in errors, which cause problems and bottlenecks.
Software development
Information security
fromSecuritymagazine
2 months ago

Product Spotlight on Analytics

Taelor Sutherland is Associate Editor at Security magazine covering enterprise security, coordinating digital content, and holding a BA in English Literature from Agnes Scott College.
Artificial intelligence
fromInfoWorld
2 months ago

Teradata unveils enterprise AgentStack to push AI agents into production

Teradata positions Enterprise AgentStack as a vendor-agnostic execution layer across hybrid environments, contrasting platform-tied AI approaches from Snowflake and Databricks.
fromMedium
2 months ago

How I Fixed a Critical Spark Production Performance Issue (and Cut Runtime by 70%)

"The job didn't fail. It just... never finished." That was the worst part. No errors.No stack traces.Just a Spark job running forever in production - blocking downstream pipelines, delaying reports, and waking up-on-call engineers at 2 AM. This is the story of how I diagnosed a real Spark performance issue in production and fixed it drastically, not by adding more machines - but by understanding Spark properly.
Data science
fromMedium
3 months ago

The Complete Guide to Optimizing Apache Spark Jobs: From Basics to Production-Ready Performance

Optimize Spark jobs by using lazy evaluation awareness, early filter and column pruning, partition pruning, and appropriate join strategies to minimize shuffles and I/O.
Software development
fromInfoWorld
2 months ago

Why your next microservices should be streaming SQL-driven

Streaming SQL with UDFs, materialized results, and ML/AI integrations enables continuous, stateful processing of event streams for microservices.
Artificial intelligence
fromMedium
2 months ago

Extracting AI-Ready Data From Organizational Documents

Poor document extraction corrupts retrieval; preserving document structure at ingestion produces reliable embeddings and trustworthy RAG outputs.
Data science
fromInfoQ
1 month ago

Databricks Introduces Lakebase, a PostgreSQL Database for AI Workloads

Databricks Lakebase is a serverless PostgreSQL OLTP database that separates compute from storage and unifies transactional and analytical capabilities.
Software development
fromDbmaestro
4 years ago

If You Don't Have Database Delivery Automation, Brace Yourself for These 10 Problems |

Manual database processes break DevOps pipelines; only 12% deploy database changes daily, causing configuration drift, frequent errors, slower time-to-market, and reduced productivity.
#enterprise-ai
fromZDNET
2 months ago
Artificial intelligence

Stuck in AI pilot mode? IBM has a solution to help you scale - without ripping everything up

fromZDNET
2 months ago
Artificial intelligence

Stuck in AI pilot mode? IBM has a solution to help you scale - without ripping everything up

Data science
fromDevOps.com
2 months ago

Why Data Contracts Need Apache Kafka and Apache Flink - DevOps.com

Data contracts formalize schemas, types, and quality constraints through early producer-consumer collaboration to prevent pipeline failures and reduce operational downtime.
fromTechzine Global
1 month ago

Databricks shows how AI strengthens the SaaS model

The rise of generative AI is often seen as an existential threat to the SaaS model. Interfaces would disappear, software would fade away, and existing players would become irrelevant. However, new figures from Databricks paint a different picture. Rather than undermining SaaS, AI appears to be increasing its use. This week, Databricks reported a revenue run rate of $5.4 billion, a 65 percent year-on-year increase. More than a quarter of that now comes from AI-related products.
Artificial intelligence
Data science
fromInfoQ
2 months ago

Beyond the Warehouse: Why BigQuery Alone Won't Solve Your Data Problems

Data warehouses like BigQuery perform well initially but become slow, costly, and disorganized at scale, undermining low-latency operational use and innovation.
Data science
fromInfoWorld
2 months ago

Snowflake debuts Cortex Code, an AI agent that understands enterprise data context

Cortex Code enables developers to use natural language to build, optimize, and deploy governed, production-ready data pipelines, analytics, ML workloads, and AI agents.
Business intelligence
fromTechzine Global
2 months ago

ClickHouse, the open-source challenger to Snowflake and Databricks

ClickHouse is a high-performance columnar OLAP database rapidly adopted by AI and enterprise users, now valued at $15B and acquiring Langfuse.
fromTechzine Global
2 months ago

Alteryx and Google Cloud bring analytics closer to BigQuery

With the introduction of Live Query for BigQuery and Alteryx One: Google Edition, users no longer need to move data to run workflows. Companies that standardize cloud platforms for analytics and AI often see a gap between where data is stored and how it is prepared and used. Alteryx wants to change that by bringing analytics workflows directly to BigQuery. The promise: from data to insight to action, without compromising on security or scalability.
Data science
Artificial intelligence
fromTechzine Global
2 months ago

Snowflake launches Cortex Code agent for understanding data context

Cortex Code is an AI agent that converts complex data engineering, ML, and analytics tasks into natural-language workflows integrated into Snowflake and developer tools.
fromDbmaestro
5 years ago

Database Delivery Automation in the Multi-Cloud World

The main advantage of going the Multi-Cloud way is that organizations can "put their eggs in different baskets" and be more versatile in their approach to how they do things. For example, they can mix it up and opt for a cloud-based Platform-as-a-Service (PaaS) solution when it comes to the database, while going the Software-as-a-Service (SaaS) route for their application endeavors.
DevOps
fromTheregister
2 months ago

Over half of enterprise AI stalls on infrastructure mess

"If you look at the enterprise, there's just enormous enthusiasm to deploy AI, but the problem is that the infrastructure, the power, and the operational foundation that is required to run it just aren't there," Alex Bouzari, CEO of DDN, told The Register. "And so as a result, it pops up in the financial elements with IT projects getting delayed, the GPUs being underutilized, power costs going up. And so the economics, I think, for lots of organizations don't pencil out because of these challenges."
Artificial intelligence
Data science
fromCIO
2 months ago

5 perspectives on modern data analytics

Data/business analytics is the top IT investment priority, yet analytics projects often fail due to poor data, vague objectives, and one-size-fits-all solutions.
Artificial intelligence
fromTechRepublic
6 months ago

New AI Data 'Universal Translator' From Salesforce, Snowflake, Others

Snowflake and other firms created the Open Semantic Interchange to standardize semantics and enable interoperable data sharing among AI-enabled products, reducing semantic mismatches.
[ Load more ]