#data-profiling-tools

[ follow ]
Data science
fromInfoWorld
5 minutes ago

Addressing the challenges of unstructured data governance for AI

Enterprises must enhance data governance for unstructured data as AI transforms data management practices.
DevOps
fromDevOps.com
1 hour ago

Grafana Labs Extends Observability Reach Deeper Into AI - DevOps.com

Grafana Labs has enhanced its observability platform with AI capabilities and introduced new tools for AI application monitoring and data collection.
Marketing tech
fromMarTech
20 hours ago

How to unify and orchestrate your B2B data to drive revenue | MarTech

B2B organizations face revenue loss due to misalignment between marketing and sales, leading to ineffective acquisition strategies.
Scala
fromInfoQ
3 days ago

Lakehouse Tower of Babel: Handling Identifier Resolution Rules Across Database Engines

Open table formats standardize data semantics but lack SQL dialect interoperability, complicating identifier resolution across different engines.
#snowflake
Business
from24/7 Wall St.
4 days ago

Autodesk vs Snowflake: Which Is the Better Short Candidate Right Now?

Snowflake carries more downside risk than Autodesk due to its high valuation and significant losses.
Django
fromMedium
2 weeks ago

Snowflake Supports Directory Imports

Easier package imports into Snowflake functions and procedures from stage directories and SnowGit directories streamline development and deployment.
Artificial intelligence
fromTheregister
4 weeks ago

Snowflake's ongoing pitch: bring AI to data, not vice versa

Snowflake is enhancing its platform for AI integration through strategic partnerships and acquisitions, focusing on customer ROI and data management efficiency.
Business
from24/7 Wall St.
4 days ago

Autodesk vs Snowflake: Which Is the Better Short Candidate Right Now?

Snowflake carries more downside risk than Autodesk due to its high valuation and significant losses.
Django
fromMedium
2 weeks ago

Snowflake Supports Directory Imports

Easier package imports into Snowflake functions and procedures from stage directories and SnowGit directories streamline development and deployment.
Artificial intelligence
fromTheregister
4 weeks ago

Snowflake's ongoing pitch: bring AI to data, not vice versa

Snowflake is enhancing its platform for AI integration through strategic partnerships and acquisitions, focusing on customer ROI and data management efficiency.
Artificial intelligence
fromMarTech
5 days ago

3 AI shifts reshaping market research | MarTech

AI is transforming market research by evolving from a tool for tasks to a collaborative research environment that enhances data-driven insights.
Privacy professionals
fromwww.bbc.com
6 days ago

Palantir defends its record as MPs demand more scrutiny of data use

Palantir defends its NHS data contracts amid scrutiny, emphasizing its role in integrating fragmented healthcare systems and ensuring data security.
fromRealpython
6 days ago

Altair: Declarative Charts With Python Quiz - Real Python

The quiz contains 7 questions and there is no time limit. You'll get 1 point for each correct answer.
Python
Data science
fromMedium
1 day ago

What is a Datathon? And Why You Should Join One

Datathons are collaborative events where participants analyze real-world datasets to generate insights and solve practical problems.
DevOps
fromwww.bankingdive.com
1 day ago

How proactive DEX strengthens IT compliance in financial services

Proactive DEX management helps financial services organizations address compliance challenges by continuously monitoring and improving the digital workplace.
#agentic-ai
Information security
fromTechzine Global
3 weeks ago

Databricks launches Lakewatch: agentic SIEM on the Lakehouse

Lakewatch is an open SIEM platform that consolidates security, IT, and business data, enabling rapid threat detection and response using AI agents.
Information security
fromTechzine Global
3 weeks ago

Databricks launches Lakewatch: agentic SIEM on the Lakehouse

Lakewatch is an open SIEM platform that consolidates security, IT, and business data, enabling rapid threat detection and response using AI agents.
Marketing tech
fromMarTech
3 days ago

Reclaiming the power of the story - fueled by data and AI | MarTech

Marketing teams can enhance storytelling and authenticity by integrating AI without sacrificing human connection.
#generative-ai
Data science
fromMedium
4 days ago

Is the Data Scientist Role Dead? No, it's Transforming

The data scientist role is evolving, not disappearing, as organizations demand broader skills and system-oriented thinking.
DevOps
fromInfoWorld
5 days ago

The agent tier: Rethinking runtime architecture for context-driven enterprise workflows

Digital workflows in large enterprises struggle to adapt to contextual variations, leading to increased complexity and challenges in customer onboarding processes.
Marketing tech
fromFortune
4 days ago

Palantir exec: the biggest mistake retailers are making with AI? Trying to do it all with one agent | Fortune

Retail teams face challenges with AI solutions that oversimplify complex decision-making processes, leading to potential failures in operations.
Data science
fromMarTech
3 days ago

Synthetic research is a promise with a catch | MarTech

Economic pressure for quick research results conflicts with the scientific demand for rigor, leading to potential biases in synthetic data outputs.
#ai
Data science
fromTheregister
3 weeks ago

Datadog bets DIY AI will mean it dodges the SaaSpocalypse

Datadog is releasing an AI model to enhance its observability tools and mitigate risks from customers building their own solutions.
Data science
fromTheregister
3 weeks ago

Datadog bets DIY AI will mean it dodges the SaaSpocalypse

Datadog is releasing an AI model to enhance its observability tools and mitigate risks from customers building their own solutions.
Marketing tech
fromAdExchanger
1 week ago

AI Is Nothing Without Data Fidelity. Here's A Four-Step Approach to Protect It | AdExchanger

Data integrity is crucial for effective AI in advertising, as flawed data leads to poor outcomes.
fromTheregister
4 days ago

DuckDB uses RDBMS to tackle lakehouse 'small changes' issue

You make a small change to your table, adding a single row, and it affects data lake performance because, due to the way they work, a new file has to be written that contains one row, and then a bunch of metadata has to be written. This is very inefficient, because formats like Parquet really don't want to store a single row, they want to store a million rows.
Data science
Business intelligence
fromZDNET
1 week ago

I asked 5 data leaders about how they use AI to automate - and end integration nightmares

Strong processes and AI integration are essential for businesses to effectively utilize data.
Science
fromNature
4 weeks ago

Drowning in data sets? Here's how to cut them down to size

The Square Kilometre Array Observatory will generate massive data, but storage and retention pose significant challenges for researchers.
#observability
DevOps
fromTechzine Global
2 weeks ago

Observability warehouses, the next structural evolution for telemetry

Observability is essential for real-time insights in cloud systems, helping to reduce downtime and improve performance.
DevOps
fromTechzine Global
2 weeks ago

Observability warehouses, the next structural evolution for telemetry

Observability is essential for real-time insights in cloud systems, helping to reduce downtime and improve performance.
fromInfoWorld
3 weeks ago

Databricks pitches Lakewatch as a cheaper SIEM - but is it really?

"This is more likely to complement existing SIEMs than replace them. Early adoption will come from large enterprises already committed to Databricks, especially those seeking flexibility or cost control."
Information security
Data science
fromFast Company
1 week ago

Data, not infrastructure, must drive your AI strategy

Data centricity is essential for effective AI strategies, enabling collaboration and problem-solving across business units by making data accessible.
Marketing tech
fromEMARKETER
3 weeks ago

Brands want personalization at scale, but their data stack keeps getting in the way

Limited platform integration is the top barrier to personalization for 42% of brand marketers and 47% of agency marketers in North America.
Business intelligence
fromInfoWorld
2 weeks ago

Kilo targets shadow AI agents with a managed enterprise platform

KiloClaw for Organizations enhances AI agent management with centralized governance, addressing security and compliance concerns for enterprises.
Data science
fromAol
2 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Software development
fromMedium
1 month ago

Unified Databricks Repository for Scala and Python Data Pipelines

Databricks repositories require structured setup with Gradle for multi-language support, dependency management, and version control to scale beyond manual notebook maintenance.
DevOps
fromInfoWorld
1 month ago

Update your databases now to avoid data debt

Multiple major open source databases reach end-of-life in 2026, requiring teams to plan upgrades and migrations to avoid security risks and higher costs.
Business intelligence
fromInfoWorld
1 month ago

Snowflake's new 'autonomous' AI layer aims to do the work, not just answer questions

Project SnowWork is Snowflake's autonomous AI layer that automates data analysis tasks like forecasting, churn analysis, and report generation without requiring data team intervention.
Artificial intelligence
fromInfoWorld
1 month ago

Databricks launches Genie Code to automate data science and engineering tasks

Databricks launched Genie Code, an AI agent that automates data science and engineering tasks within its lakehouse platform to accelerate ML workflows and enterprise data operations.
fromThe Drum
2 months ago

Deeper data delivers more inspired partnership decisions

Imagine you're selecting an influencer to work with on your new campaign. You've narrowed it down to two, both in the right area, both creating the right sort of content. One has 24.6 million subscribers, the other 1.4 million. Which do you choose? Now imagine you could find out the first had 8.7 million unique viewers last month, while the second had 9.9 million. Do you want to change your mind?
Marketing
Tech industry
fromComputerworld
2 months ago

New Tableau AI features and Slack integration aim for data accessibility

Tableau added AI-powered personalization, automation, natural-language data stories, data mapping, and Slack integration to make data more accessible and actionable for business users.
Miscellaneous
fromTechzine Global
2 months ago

Klarrio uses open source expertise to build foundational data platforms

Klarrio builds compliant, scalable open-source data platforms and platform-engineering foundations, integrating and securing underlying infrastructure so customers can focus on analytics and data science.
Data science
fromMedium
1 month ago

Building Consistent Data Foundations at Scale

Building consistent data foundations through intentional architecture, engineering, and governance is essential to prevent fragmentation, support AI adoption, ensure regulatory compliance, and enable reliable organizational decisions at scale.
fromDbmaestro
5 years ago

5 Pillars of Database Compliance Automation |

There is a growing emphasis on database compliance today due to the stricter enforcement of compliance rules and regulations to safeguard user privacy. For example, GDPR fines can reach £17.5 million or 4% of annual global turnover (the higher of the two applies). Besides the direct monetary implications, companies also need to prioritize compliance to protect their brand reputation and achieve growth.
EU data protection
Startup companies
fromInfoQ
2 months ago

Etleap Launches Iceberg Pipeline Platform to Simplify Enterprise Adoption of Apache Iceberg

Managed Iceberg pipeline platform unifies ingestion, transformation, orchestration, and table operations inside customers' VPCs, enabling enterprise Iceberg adoption without building custom stacks.
fromInfoWorld
2 months ago

AI is changing the way we think about databases

Developers have spent the past decade trying to forget databases exist. Not literally, of course. We still store petabytes. But for the average developer, the database became an implementation detail; an essential but staid utility layer we worked hard not to think about. We abstracted it behind object-relational mappers (ORM). We wrapped it in APIs. We stuffed semi-structured objects into columns and told ourselves it was flexible.
Software development
Django
fromRealpython
1 month ago

Automate Python Data Analysis With YData Profiling Quiz - Real Python

An interactive 8-question quiz assesses proficiency in YData Profiling for automating Python data analysis tasks including report generation, dataset comparison, and time series preparation.
Tech industry
fromTheregister
2 months ago

Snowflake plugs PostgreSQL into its AI Data Cloud

Snowflake now offers a native PostgreSQL DBaaS in its AI Data Cloud to run transactional workloads alongside analytics and AI under unified governance.
Data science
fromMedium
1 month ago

Migrating to the Lakehouse Without the Big Bang: An Incremental Approach

Query federation enables safe, incremental lakehouse migration by allowing simultaneous queries across legacy warehouses and new lakehouse systems without risky big bang cutover approaches.
Python
fromRealpython
1 month ago

Automate Python Data Analysis With YData Profiling - Real Python

YData Profiling generates interactive exploratory data analysis reports with summary statistics, visualizations, and data quality warnings from pandas DataFrames in just a few lines of code.
Business intelligence
fromEntrepreneur
1 month ago

The Game-Changing Tech Saving Companies From Data Disasters

Combining Continuous Data Protection with AI capabilities enables businesses to achieve near-zero Recovery Point Objectives and minimal Recovery Time Objectives, preventing data loss and minimizing downtime.
Python
fromTreehouse Blog
1 month ago

Python for Data: A SQL + Pandas Mini-Project That Actually Prepares You for Real Work

Effective data analysis requires combining SQL and Python skills in integrated projects that mirror real-world workflows, not learning them in isolation.
Information security
fromSecuritymagazine
2 months ago

Product Spotlight on Analytics

Taelor Sutherland is Associate Editor at Security magazine covering enterprise security, coordinating digital content, and holding a BA in English Literature from Agnes Scott College.
Business intelligence
fromTechzine Global
1 month ago

Dataiku introduces platform for scalable enterprise AI

Dataiku launches Platform for AI Success with three new products designed to move AI initiatives from pilots to measurable business outcomes through unified orchestration across cloud providers.
fromInfoWorld
2 months ago

AI-augmented data quality engineering

SHAP for feature attribution SHAP quantifies each feature's contribution to a model prediction, enabling: LIME for local interpretability LIME builds simple local models around a prediction to show how small changes influence outcomes. It answers questions like: "Would correcting age change the anomaly score?" "Would adjusting the ZIP code affect classification?" Explainability makes AI-based data remediation acceptable in regulated industries.
Artificial intelligence
fromTechzine Global
2 months ago

Sumo Logic launches data pipeline apps for Snowflake and Databricks

Snowflake offers a fully managed data platform, but Sumo Logic users often lack insight into performance, login activity, and operational health. The Sumo Logic Snowflake Logs App analyzes login and access activity to identify anomalies or suspicious behavior. It also optimizes data pipelines with insights into long-running or failing queries. Teams can centralize log data to facilitate correlation across applications, cloud services, and data platforms.
Information security
Data science
fromMedium
1 month ago

100 Scala Interview Questions and Answers for Data Engineers

Structured Scala and Apache Spark interview preparation requires understanding distributed systems, performance trade-offs, and pipeline design beyond theoretical knowledge.
fromMoz
2 months ago

Why Export GA4 Data to BigQuery?

Then coming on to the next point, which is you can create your own sessions and user properties. Now you can do this in the GA4 interface under Explorations.
Marketing tech
Artificial intelligence
fromMedium
2 months ago

Extracting AI-Ready Data From Organizational Documents

Poor document extraction corrupts retrieval; preserving document structure at ingestion produces reliable embeddings and trustworthy RAG outputs.
Artificial intelligence
fromTechzine Global
2 months ago

Snowflake launches Cortex Code agent for understanding data context

Cortex Code is an AI agent that converts complex data engineering, ML, and analytics tasks into natural-language workflows integrated into Snowflake and developer tools.
fromFast Company
1 month ago

Beware of data hubris

Organizations are drowning in dashboards, KPIs, performance metrics, behavioral traces, biometric indicators, predictive scores, engagement rates, and AI-generated forecasts. We have more data than we know what to do with. We pretend that the mere presence of data guarantees clarity. It does not. That's data hubris—the arrogant belief that because something can be measured, it can be mastered.
Business intelligence
Data science
fromInfoWorld
1 month ago

The revenge of SQL: How a 50-year-old language reinvents itself

SQL has experienced a major comeback driven by SQLite in browsers, improved language tools, and PostgreSQL's jsonb type, making it both traditional and exciting for modern development.
Data science
fromComputerworld
2 months ago

Tableau re-engineers dashboards, adds new analytics tools for business analysts

Tableau 2022.3 adds Data Guide and Table Extension, dynamic dashboards, event auditing, and performance/cost optimization to simplify self-service analytics for business users.
Artificial intelligence
fromTechRepublic
6 months ago

New AI Data 'Universal Translator' From Salesforce, Snowflake, Others

Snowflake and other firms created the Open Semantic Interchange to standardize semantics and enable interoperable data sharing among AI-enabled products, reducing semantic mismatches.
Data science
fromCIO
2 months ago

5 perspectives on modern data analytics

Data/business analytics is the top IT investment priority, yet analytics projects often fail due to poor data, vague objectives, and one-size-fits-all solutions.
fromMedium
2 months ago

Why "Data Scientist" is Becoming "AI Engineer" and What That Actually Means

The title "data scientist" is quietly disappearing from job postings, internal org charts, and LinkedIn headlines. In its place, roles like "AI engineer," "applied AI engineer," and "machine learning engineer" are becoming the norm. This Data Scientist vs AI Engineer shift raises an important question for practitioners and leaders alike: what actually changes when a data scientist becomes an AI engineer, and what stays the same? More importantly, what skills matter if you want to make this transition intentionally rather than by accident?
Artificial intelligence
fromTreehouse Blog
2 months ago

Portfolio Projects for Entry-Level Data Roles

Most beginner data portfolios look similar. They include: A few cleaned datasets Some charts or dashboards A notebook with code and commentary Again, nothing here is wrong. But hiring teams don't review portfolios to check whether you can follow instructions. They review them to see whether you can think like a data analyst. When projects feel generic, reviewers are left guessing:
Data science
Artificial intelligence
fromInfoWorld
2 months ago

Teradata unveils enterprise AgentStack to push AI agents into production

Teradata positions Enterprise AgentStack as a vendor-agnostic execution layer across hybrid environments, contrasting platform-tied AI approaches from Snowflake and Databricks.
Data science
fromInfoWorld
2 months ago

Snowflake debuts Cortex Code, an AI agent that understands enterprise data context

Cortex Code enables developers to use natural language to build, optimize, and deploy governed, production-ready data pipelines, analytics, ML workloads, and AI agents.
fromComputerWeekly.com
2 months ago

AI slop pushes data governance towards zero-trust models | Computer Weekly

Unverified and low quality data generated by artificial intelligence (AI) models - often known as AI slop - is forcing more security leaders to look to zero-trust models for data governance, with 50% of organisations likely to start adopting such policies by 2028, according to Gartner's seers. Currently, large language models (LLMs) are typically trained on data scraped - with or without permission - from the world wide web and other sources including books, research papers, and code repositories.
Artificial intelligence
Data science
fromInfoQ
2 months ago

Beyond the Warehouse: Why BigQuery Alone Won't Solve Your Data Problems

Data warehouses like BigQuery perform well initially but become slow, costly, and disorganized at scale, undermining low-latency operational use and innovation.
Data science
fromInfoQ
1 month ago

Databricks Introduces Lakebase, a PostgreSQL Database for AI Workloads

Databricks Lakebase is a serverless PostgreSQL OLTP database that separates compute from storage and unifies transactional and analytical capabilities.
Data science
fromTechzine Global
1 month ago

Ataccama puts agentic data observability into platform core

Ataccama ONE introduces Agentic Data Observability technology to ensure high-quality, reliable data for AI systems while preventing autonomous errors and bias in regulated enterprises.
Data science
fromComputerworld
2 months ago

Great R packages for data import, wrangling, and visualization

A set of R packages (dplyr, purrr, readr/vroom, datapasta, Hmisc) streamline data wrangling, importing, and analysis with faster, standardized, and reproducible tools.
[ Load more ]