
"I wrote a book for O'Reilly on scaling machine learning with Spark specifically. My second book is coming out on how to improve high-performance Spark, the second edition. Started my career in the machine learning space 15 years ago, moved into data infrastructure, batch processing, and a year and a half ago I moved into the data streaming space, which I think it's what's going to help us pave the future in the data."
"Since I turned data 10 years ago, my then boss said, maybe you have a look at Spark. Wasn't the worst advice back then, 10 years ago now, looking at this. Yes, helping clients with data architectures, building data platforms, and especially interested because of my software engineering background in how to get data from operational systems towards an analytical system."
Adi Polak works for Confluent, building a data streaming platform and contributing to Apache Kafka, Apache Flink, and Apache Iceberg. He authored an O'Reilly book on scaling machine learning with Spark and is releasing a second edition on high-performance Spark. He began in machine learning 15 years ago, moved into data infrastructure and batch processing, and recently moved into data streaming. Sarah Usher transitioned from software engineering to data engineering and has worked across banking, law, insurance, ad tech, and developer security, focusing on scale, diverse datasets, and cultural challenges. Matthias Niehoff applies his software engineering background to build data platforms and move operational data toward analytical systems.
Read at InfoQ
Unable to calculate read time
Collection
[
|
...
]