Spark Catalyst Optimizer and spark Expression basics


Table of contents Overview Trees Rules Expression CodegenFallback Example of spark native function using Unary expression Spark Catalyst Overview Core of Spark dataframe API and SQL queries. Supports cost based and rule based optimization. Built to be extensible : Adding new optimization techniques and features Extending the optimizier for custom use cases At core it uses trees On top of it various libraries are written for query processing, optimization and execution.…
Read more ⟶

Traffic flow in Kubernetes 101


Table of contents Intra pod communication Inter pod communication CNI plugin Pod to service communication Intra pod communication When pod is created, It is assigned to a node. A network namespace is created for it(i.e all of that pod’s containers belong to that namespace), IP is assigned to it. For every pod in the cluster there is pause container running in the background. The pause container creates and holds network namespace for that pod.…
Read more ⟶

GSoC 2020 Report


Introduction During my GSoC, I worked on App Store improvements project for ns-3 organization. GSoC was my first programming experience outside personal projects and I thoroughly enjoyed the experience. I had an awesome opportunity to work for the ns-3 organization. My mentors abhijithanilkumar, mishalshah and adeepkit01 were extremely responsive, helpful, and understanding. I would also like to thank tomh sir - the ns-3 Organization Admin for his help and suggestions.…
Read more ⟶

Integrating Elasticsearch 7 to Django project


Adding search ability to django using Elasticsearch
Read more ⟶