-
๐ฑ Iโm currently learning Functional programming in Scala/Kotlin, Designing Data Engineering Applications, Google cloud & Databricks
-
๐ I regularly write articles on Vitthal Mirji
-
๐ Check out my libraries published on Maven Central Repository:
- Datapipelines Essentials ๐ธ, a Best practices APIs/libraries for data engineering & data quality
- DATA ENGINEERING & WAREHOUSING FUNCTIONAL PROGRAMMING EXTENSIONS ๐ฉ๏ธ, Apply Engineering methods on data, taking influence of functional programming & chain your algorithm steps with power of Scala
- SHC - BigTable-HBase Connector ๐, a Google BigTable with namespaces & name descriptors - bridging the gap
- Nested Complex Data typed Data Parsing in Spark ๐ญ, a prototype library implementing to Derive new attributes from XML when you have XPATH transformations. Accelerate boring stuff in #Pyspark & Python Also check out how to handle multi line XML
- Vitthal Mirji Blog ๐งช, My blog
-
๐ฌ Ask me about Data Engineering, Machine Learning, Building Frameworks, Low-level Design Patterns, Data Warehousing, Java, Scala, Cats (still learning), Akka, Python, SQL and Kotlin
-
๐ Know about my experiences https://www.linkedin.com/in/vitthal10/
-
Staff Data Engineer @walmart
- Mumbai, India
-
17:51
(UTC +05:30) - https://www.vitthalmirji.com
- @whoami_vim
- in/vitthal10
Pinned Loading
-
datapipelines-essentials-python
datapipelines-essentials-python PublicSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformโฆ
-
Design-Patterns
Design-Patterns PublicDesign patterns provide a reusable solution to commonly occurring software problems.
Java 1
-
gcp-datalake
gcp-datalake PublicRead various types of files from Google storage, Maps data to Google Bigtable & Performs bulk load into Google Bigtable
Scala 1
-
MapReduceExamples
MapReduceExamples PublicMapReduce various examples & Algorithm & Hadoop Batch Processing using MapReduce
Java 1
-
-
hortonworks-spark/shc
hortonworks-spark/shc PublicThe Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
If the problem persists, check the GitHub status page or contact support.

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.


