Iceberg + Kafka + Flink + Presto/Dremio : modern realtime data platformA good data platform has support for preprocessing as well as post processing of data basically ELT(Extract, Load and transform) and ETL…Feb 121Feb 121
DuckDB + Dbt + great expectations = Awesome Data pipelinesI have been using spark for majority of data pipelines and while I like Spark for orchestrating my data transformations, sometimes it can…Sep 12, 20233Sep 12, 20233
AI with PandasPandas revolutionalized the data analysis industry when it was introduced in 2008. To this day, it is actively used to do exploratory data…Aug 13, 2023Aug 13, 2023
Streaming Data PlatformMost of the companies today are racing to build their data systems in place. Gone are the days when the business was happy with reports and…Dec 4, 2022Dec 4, 2022
Data analytics platformToday in the 21st century data has become the new oil. Companies are scrambling today to get data from multiple sources, clean, transform…Jul 30, 2022Jul 30, 2022
MLOps: DevOps fancier cousinWe have got a new Sheriff in town. The big brother of DevOps…meet MLOps.Dec 25, 2020Dec 25, 2020
Deploy ML models using FlaskRecently I got a coding task to develop an end to end solution to find the most k words that occur most frequently in an arbitrary number…Nov 28, 2020Nov 28, 2020
ML EngineeringLately a new job description has been popping up quite a bit, the designation of ML Engineer. Who is this mythical creature and why is he…Nov 21, 2020Nov 21, 2020