Dharmesh Kakadia

Lead ML & Data at Microsoft

Curious. Currently working on ML in FinTech

Internals of Spark Parser

In this post we will try to demystify details about Spark Parser and how we can implement a very simple language with the use o...

Read the full article

Verifying links with Github actions & Awesome Bot

Recently I started using github action to automate link checking in all of my awesome repos. I have been using awesome_bot to v...

Read the full article

Versatile RStudio development environment on Kubernetes

R is very versatile language for data analysis and widely used for data science and exploration alongside python. RStudio is a ...

Read the full article

MXNet tools in docker

How to convert MXNet model to Apple CoreML: docker run -v "$PWD":/data --rm -it dharmeshkakadia/mxnet-coreml-tools-docker pyth...

Read the full article

Review - Are Ideas Getting Harder to Find?

This is a review of a recent paper Are Ideas Getting Harder to Find? by Charles I. Jones. Slides are also available. The centr...

Read the full article

Automate SQL server data pipelines with Kubernetes

Kubernetes provides a great way to run modern infrastructure. SQL server is a widely deployed database. When you combine these ...

Read the full article

Write a Presto query logging plugin

Presto is a fast distributed SQL query engine for big data. I wrote a more introductory and up and running post a while back. ...

Read the full article

Analyzing Azure Storage Performance

I work on performance of Big data systems at Azure HDInsight and as part of benchmarking, many times I need to analyze the perf...

Read the full article

OpenFaaS on Minikube

Minimal steps to run serveless/functions-as-a-service platform on Minikube. Start minikube. minikube start ...

Read the full article

Multi stage docker build for go

Support for multistage docker build has landed in Docker earlier this year. Multi stage builds simplify the image building and ...

Read the full article

Go Dep Example

I wanted to give Go’s new dependency management system — dep - a try. I searched for a minimal example and did not find one. So...

Read the full article

Presto on HDInsight

This article will explain presto internals and how to install presto on Azure HDInsight. If you are familiar with presto, you c...

Read the full article

Book review - Sapiens

Just read it.

Read the full article

Mesos Podcast with SEDaily

I did a podcast with Software Engineering Daily recently about Mesos, Kubernetes and infrastructure future. Do check it out and...

Read the full article

How to write a Hive Hook

Hive Hooks are little known gems that can be used for many purposes. In this post we will take a deeper look at what a Hive hoo...

Read the full article

Link to my old blog

I used to blog on blogger before I moved to github. Here is the link to my old blog : dharmeshkakadia.blogspot.com

Read the full article

Hello World

I am finally moving to jekyell thanks to Jekyll Now repository. I plan to gradually move my Blog and my personal web page here.

Read the full article

Compiling hive for a non-release hadoop version

We have been working on many interesting things around Perforator like extending the core model to other systems like hive, tez...

Read the full article

A great experience at Microsoft Summer School 2012

I had the opportunity of attending Microsoft Summer School on Distributed Algorithms, Systems, and Programming this summer. It ...

Read the full article