Hi, my name is

Hema Harsha Vardhan Peela.

I'm a Data Engineer specializing in building robust, scalable data pipelines and transforming complex datasets into actionable business intelligence.

Check out my work!

About Me

Hello! I'm a passionate Data Engineer with a knack for turning raw data into meaningful stories. My journey into data began with a fascination for how structured information could solve real-world problems. Today, I have the privilege of designing, building, and maintaining data infrastructure that powers critical business decisions.

My experience includes optimizing query performance by 40% and enhancing data quality. I thrive in collaborative environments and am always eager to learn new technologies.

  • Python (PySpark)
  • AWS (Glue, Redshift)
  • GCP (BigQuery, Dataflow)
  • SQL & NoSQL
  • Airflow
  • Tableau & Power BI
Hema Harsha Vardhan Peela

Where I've Worked

Data & Cloud Engineer @ YourBook Team

Nov 2024 - May 2025

  • Redesigned cloud warehouse schemas (PostgreSQL, Redshift), boosting query performance by 40%.
  • Developed and maintained systems for the Analytics Infrastructure & Data Lake.
  • Built Power BI/SQL dashboards, reducing quality escalations by 25%.

Things I've Built

Sales Performance Optimization

An automated ETL pipeline and Tableau dashboard to track sales KPIs, improving forecast accuracy by 25%.

AWS Glue   Lambda   Python   Tableau

Android API DataPipeline

Engineered a pipeline to collect, process, and store Android app data from APIs, enabling efficient data analysis and reporting.

Python   API   Data Pipeline

How Transformers Do Math

Explored how transformer models approach arithmetic and mathematical reasoning with code and experiments.

Python   Transformers   Machine Learning

Result Analysis Project

A web app that analyzes student performance and visualizes results to identify trends and support better decision-making.

ReactJS   Django   Data Visualization

Social Media Data Filtering

Developed a model using PySpark and ML to filter spam and improve content quality by 45%.

PySpark   Scikit-learn   Pandas

Customer Segmentation

Implemented a K-Means model in R that led to a 30% reduction in customer churn.

R   K-Means   Machine Learning

04. What's Next?

Get In Touch

I'm currently seeking new opportunities and my inbox is always open. Whether you have a question, a project, or just want to connect, I'll get back to you!

Say Hello