Data science

[ follow ]
fromInfoWorld
1 year ago

Snowflake updates developer tools, adds observability features

Snowflake Trail enhances observability by allowing developers to monitor data quality, pipelines, and applications, ultimately improving workflow optimization and troubleshooting capabilities.
Data science
#data-analytics
fromMedium
4 days ago
Data science

The Data Science Playbook: Exploring Sports Analytics Through Real Datasets

fromMedium
4 days ago
Data science

The Data Science Playbook: Exploring Sports Analytics Through Real Datasets

#data-management
Data science
fromInfoWorld
1 month ago

Building an analytics architecture for unstructured data and multimodal AI

Organizations must adapt data pipelines for scalability and consistency to leverage AI effectively.
Flexible data preparation processes are essential for managing unstructured data and evolving with business needs.
Data science
fromInfoWorld
1 month ago

Building an analytics architecture for unstructured data and multimodal AI

Organizations must adapt data pipelines for scalability and consistency to leverage AI effectively.
Flexible data preparation processes are essential for managing unstructured data and evolving with business needs.
#data-integration
fromHackernoon
1 year ago
Data science

A Developer's Guide to SeaTunnel and Hive Integration with Real-World Configs | HackerNoon

fromHackernoon
1 year ago
Data science

A Developer's Guide to SeaTunnel and Hive Integration with Real-World Configs | HackerNoon

fromHackernoon
2 years ago

Why No Single Algorithm Solves Deduplication - and What to Do Instead | HackerNoon

Effective blocking dramatically cuts comparisons while still grouping true duplicates together. Several blocking strategies can be applied in multi-pass to improve recall.
Data science
fromInfoWorld
1 year ago

What's new in MySQL 9.0

MySQL 9.0.0 introduces a new Vector datatype, JavaScript Stored Programs, updated library versions, and enhancements to the Event Scheduler, while deprecating old SHA-1 security.
Data science
#data-quality
fromTechCrunch
1 week ago
Data science

AI is forcing the data industry to consolidate - but that's not the whole story | TechCrunch

fromTechCrunch
1 week ago
Data science

AI is forcing the data industry to consolidate - but that's not the whole story | TechCrunch

#open-source
fromInfoQ
1 week ago
Data science

Databricks Contributes Spark Declarative Pipelines to Apache Spark

fromInfoQ
1 week ago
Data science

Databricks Contributes Spark Declarative Pipelines to Apache Spark

#data-visualization
fromHackernoon
4 years ago

What If Your 'Messy' Data Is Actually Perfect? | HackerNoon

The Success Metrics layer transforms a vision from aspiration to action by defining what success looks like and how we'll know when we've achieved it.
Data science
#model-evaluation
#data-processing
#data-analysis
fromESPN.com
2 weeks ago
Data science

NHL draft grades: From the excellent (Islanders, Hurricanes) to the confusing (Maple Leafs)

fromESPN.com
2 weeks ago
Data science

NHL draft grades: From the excellent (Islanders, Hurricanes) to the confusing (Maple Leafs)

fromMedium
1 month ago

Frequent Spark Interview QuestionsPart 2

Both cache() and persist() store an RDD/DataFrame/Dataset in memory (or disk) to avoid recomputation. cache() is shorthand for persist(StorageLevel.MEMORY_ONLY), while persist() offers more control.
Data science
fromDevOps.com
2 weeks ago

DataOps and Automation: The Future of Database Management - DevOps.com

"The opportunity cost of inaction is steep. Research from Forrester shows that 31% of companies cite an inability to adapt to market or competitive pressure due to data management challenges."
Data science
fromTheregister
2 weeks ago

A trip through vintage datacenter networking

Mainframe manufacturers defined their own proprietary network protocol stacks, e.g., IBM System Network Architecture, Digital's DECNet. These generally ran over leased lines between datacenters.
Data science
fromMedium
3 weeks ago

RDD vs DataFrame vs Dataset in Apache Spark: Which One Should You Use and Why

Spark offers three main APIs—RDD, DataFrame, and Dataset—each with unique advantages: RDD provides low-level control, DataFrames optimize performance, and Datasets bring type safety.
Data science
#ai
fromwww.theguardian.com
2 weeks ago

Antarctic ice has grown again but this does not buck overall melt trend

Antarctic ice gained mass from 2021 to 2023, showing climate change follows a jagged path with temporary gains amid long-term losses.
Data science
#snowflake
#software-engineering
#artificial-intelligence
fromNature
3 weeks ago
Data science

Medical AI can transform medicine - but only if we carefully track the data it touches

fromNature
3 weeks ago
Data science

Medical AI can transform medicine - but only if we carefully track the data it touches

fromNature
3 weeks ago

Will Gates and other funders save massive public health database at risk from Trump cuts?

Ending the DHS would be catastrophic," says Peter Macharia, a spatial epidemiologist from Kenya, now at the Institute of Tropical Medicine in Antwerp, Belgium. Macharia says his PhD on child health interventions was based entirely on DHS data from Kenya. "Where would we get our new statistics from? We would not know what is happening in terms of health in the communities and the needs in each area," he says.
Data science
from24/7 Wall St.
3 weeks ago

Snowflake (NYSE: SNOW) Price Prediction and Forecast 2025-2030 (June 2025)

Shares of Snowflake Inc. surged 6.56% in the past month, achieving a year-to-date gain of 70.82%, with Q1 revenue exceeding $1 billion for the first time.
Data science
fromWIRED
3 weeks ago

India Is Using AI and Satellites to Map Urban Heat Vulnerability Down to the Building Level

Remote-sensing data and AI are being utilized to identify heat-vulnerable buildings in cities like Delhi, targeting efforts to provide relief during extreme temperatures.
Data science
fromHackernoon
1 year ago

Are Judeo-Christian Values the Foundation of American Democracy? | HackerNoon

There are some that claim the US Constitution is a product of a Judeo-Christian culture, asserting that democracy matured due to a Christian influence.
Data science
fromwww.npr.org
3 weeks ago

Greetings from Shenyang, China, where workers sort AI data in 'Severance'-like ways

Cities like Shenyang, once reliant on declining industries, are reinventing themselves by focusing on new tech initiatives, particularly in AI data processing to create new jobs.
Data science
fromTalkpython
3 weeks ago

10 Polars Tools and Techniques To Level Up Your Data Science

There are many benefits to Polars directly of course.
Data science
fromLos Angeles Times
3 weeks ago

'We are still here, yet invisible.' Study finds that U.S. government has overestimated Native American life expectancy

The findings of this study reveal that systemic misclassification is further exacerbating existing health disparities, leading to a tragic underrepresentation of the true mortality rates for American Indian and Alaska Native individuals.
Data science
fromBusiness Insider
4 weeks ago

Data centers' environmental impact is hard to quantify. Here's how we did it.

Tech companies are investing hundreds of billions into data centers for AI, but the environmental and economic costs are largely unaccounted for, raising critical questions.
Data science
fromeLearning Industry
1 month ago

Data-Driven L&D: Building Real-Time Learning Analytics Dashboards With No-Code

In today's hyper-digital workplace, the shift from traditional training methods to real-time insights through no-code analytics dashboards is revolutionizing Learning and Development.
Data science
fromHackernoon
4 weeks ago

The Data Science Behind r/antiwork's Upvotes | HackerNoon

The dataset for our analysis was shaped by filtering out potentially biased comments, ensuring that the final set was representative and valid for our study.
Data science
Data science
fromThe Verge
1 month ago

Google has a new AI model and website for forecasting tropical storms

Google's new AI model forecasts tropical cyclones more accurately than traditional models, promising improved storm tracking and preparation.
#machine-learning
fromHackernoon
1 month ago

Why Data Lies (and Your Model Might Too): The Curious Case of Simpson's Paradox | HackerNoon

The conditional probability P(Admit∣ Female, Dept) is higher than P(Admit∣ Male, Dept) in Department A, but that advantage gets wiped out when we aggregate everything.
Data science
fromBusiness Matters
1 month ago

Mostly AI launches $100k global challenge to spotlight privacy-safe synthetic data for AI development

"Open data access is key to unlocking AI's full potential - but achieving that will require wider adoption of synthetic data tools."
Data science
Data science
fromwww.npr.org
1 month ago

How a dog aging project can help pets and humans live healthier lives

The Dog Aging Project aims to uncover health trends in dogs to improve their longevity and gain insights applicable to human health.
Data science
fromwww.theguardian.com
1 month ago

Alzheimer's blood test can spot people with early symptoms, study suggests

A new blood test can accurately diagnose Alzheimer's with high sensitivity and specificity, suggesting a major advance in early detection.
Data science
fromZDNET
1 month ago

The hidden data crisis threatening your AI transformation plans

Siloed data limits holistic understanding, especially for AI applications.
Data science
fromFlowingData
1 month ago

Professor who studied honesty loses tenure over faked data

Francesca Gino lost Harvard tenure due to allegations of data falsification, despite her extensive research on honesty.
fromHackernoon
9 months ago

How GitHub and Stack Overflow Data Were Verified for Research Accuracy | HackerNoon

To enhance construct validity, we implemented strategies such as pilot experiments for data labelling agreements and consensus involvement to mitigate personal bias.
Data science
[ Load more ]