About
Computer Engineering student specializing in data engineering, big data, and cloud-based analytics systems. Experienced in building ETL pipelines, dimensional data models, and scalable data platforms using Python, SQL, Spark, Kafka, AWS, and Snowflake.
Experience
-
Data EngineerFyre GigFeb 2025 – May 2026
Education
-
B.E. in Computer EngineeringThakur College of Engineering and Technology, MumbaiComputer Engineering · Jul 2022 – Jul 2026
Skills
- SQL
- Git
- Python
- Power BI
- Tableau
- MySQL
- Java
- JavaScript
- MongoDB
- Streamlit
- PostgreSQL
- AWS
- Operating Systems
- Docker
- Apache Kafka
- Apache Spark
- Kubernetes
- DBMS
- Terraform
- PySpark
- Databricks
- Snowflake
- ETL
- Looker
- DBT
- Azure
- Distributed Systems
- Apache Airflow
- Data Modelling
- GCP
- ELT
- Redis
- Apache Iceberg
- Apache Flink
- C++
- Data Structures & Algorithms
- Dimensional modelling
- Bigquery Sql
Projects
-
AI-Powered Banking Query AssistantPython, RAG, LLMs, Oracle
Agentic RAG over an Oracle database with provenance tracing and governance for explainable, auditable banking queries.
-
Multimodal RAG Document PipelinePython, RAG, Vector Search, LLMs
Vector-indexed retrieval over combined text and image content for grounded, source-cited question answering.
-
AWS Infrastructure AutomationTerraform, AWS, GCP
Reusable, parameterized Terraform modules for scalable, governed cloud provisioning with remote state management.
-
Centralized Logging PipelineNode.js, Kafka, Logstash, OpenSearch, Docker
Dockerized log-aggregation stack streaming Node.js logs through Kafka, Logstash, and OpenSearch with real-time dashboards for monitoring and search.
-
Music Data ETL PipelinePython, SQL, AWS Lambda, S3, Snowflake
Automated Spotify data ingestion into Snowflake via AWS Lambda and S3 staging, loading a star schema for scalable analytics.
-
Real-Time Streaming & Anomaly Detection PipelinePython, Kafka, SQL, Streamlit
Kafka-based pipeline for real-time log ingestion, anomaly detection, and AI-powered classification surfaced via Streamlit dashboards.
Courses & certifications
- Cybersecurity Analyst · Forage
- Spark with Databricks, Snowflake, Airflow · DataVidhya
- Data Streaming Engineer & Apache Flink® · Confluent
🏆 Achievements & awards
-
Medium articles recognized by Tim Berglund and Jason Hughes
Medium articles on Kafka, Airflow, Flink, and Iceberg were recognized by Tim Berglund, VP at Confluent, and Jason Hughes, Field CTO at Snowflake.