Recap a talk on Microsoft Research Summit 2021 by "Arun Iyer" on challenges building a DEEGO graph learning solution for search and recommendation problems.
Why 90% AI/ML projects not productionized? Enable Production ML with Databricks Feature Store (Databricks Data+AI Summit 2022) – Part 2
Why 90% AI/ML projects not productionized? Enable Production ML with Databricks Feature Store (Databricks Data+AI Summit 2022) - Part 2
Why 90% AI/ML projects not productionized? Enable Production ML with Databricks Feature Store (Databricks Data+AI Summit 2022) – Part 1
Why 90% of your AI/ML projects are not making to production? - Enable Production ML with Databricks Feature Store (Databricks Data+AI Summit 2022) - Part #1
Beyond Monitoring: The Rise of Data Observability (Databricks Data+AI Summit 2022)
This post re-captured ideas in a brilliant session hosted by Barr Moses, CEO and co-founder of Monte Carlo Data, on Data+AI Summit 2022. In this session, Barr shared her insights about data observability and present a solution to solve the problems that most Data Engineering teams faced today.
Apache Spark on Kubernetes—Lessons Learned from Launching Millions of Spark Executors (Databricks Data+AI Summit 2022)
This article summarize an exciting sharing session hosted by Zhou Jiang, Aaruna Godthi from Apple on Data+AI Summit 2022. In this session, Zhou and Aaruna talked about how they built a centralized Apache Spark cluster on Kubernetes that processes 380K+ Spark jobs per day to support analytics workflow and scientists experimentation in Apple.
MLOps at DoorDash (Data+A.I. summit 2022)
This post is a summary of a insightful tech talk by Hien Luu, Head of ML platform in Doordash about how DoorDash's MLOps infrastructure, motivation, strategy and learnings.
How AI increase drive-through sales for Starbucks (Data+A.I. summit 2022)
This post summarize a great sharing on Data+A.I. summit 2022 on how Starbucks implements a Reinforcement Learning solution to improve its sales in drive-through scenario.
Understanding Spark Connect (Reynold Xin’s keynote on Data+AI Summit 2022)
"Spark Connect" creates a "thin client" to enable Spark query capability on low-compute devices, with re-architected Spark Driver to get around some short-comings in the Monolithic Driver and better support for multi-tenancy.
LeetCode – Max Points on a Line Solution (Python, Java)
LeetCode - Max Points on a Line Solution
Managing DocumentDB(CosmosDB) stored procedure – Partitioning, Scaling and Limits
What is stored procedure for NoSQL DocumentDB(CosmosDB)? Similar to traditional relational database, DocumentDB(CosmosDB) provides a way to perform multiple operations atomically - at document level. If you ask me what is the biggest difference between MongoDB and DocumentDB, my answer would be the stored procedure provided in DocumentDB. However, there are a couple of things … Continue reading Managing DocumentDB(CosmosDB) stored procedure – Partitioning, Scaling and Limits







