Charan Tatineni

CHARAN TATINENI

DATA ENGINEER • INFRASTRUCTURE • DEVOPS • PLATFORM

AWS AWS Databricks Databricks Apache Airflow Airflow Apache Kafka Kafka Apache Spark PySpark Amazon Redshift Redshift Terraform Terraform Kubernetes Kubernetes Docker Docker CI/CD
🟢 OPEN TO WORK
VIEW WORK
DATA ENGINEERING /// CLOUD ARCHITECTURE /// PIPELINES /// SPARK /// KAFKA /// AWS /// DATABRICKS /// DATA ENGINEERING /// CLOUD ARCHITECTURE /// PIPELINES /// SPARK /// KAFKA /// AWS /// DATABRICKS ///

WORK EXPERIENCE

GoodRx

GOODRX

Data Engineer

📍 Santa Clara (Remote), CA

Aug 2025 - Present

1B+ Records/Week
  • Zero-ETL Ingestion: Reduced data latency from hours to minutes using Kafka streaming
  • Medallion Architecture: Architected trusted data marts with schema guarantees and freshness SLAs
  • Infrastructure as Code: Provisioned AWS/Databricks via Terraform with CI/CD pipelines
PySpark Airflow Databricks Kafka AWS Terraform

HYPERWATER.AI

Data Engineer

📍 Dallas, TX

May 2024 - Aug 2025

99.99% Data Availability
  • Ingestion APIs: Built REST services enforcing schema contracts and idempotency
  • Monitoring Pipeline: Reduced incident detection by 40% with CloudWatch integration
  • CI/CD Automation: Maintained 15-min freshness SLAs for critical dashboards
Python AWS S3 CloudWatch Codefresh ETL/ELT
George Mason University

GEORGE MASON UNIVERSITY

Database Administrator

📍 Fairfax, VA

Sep 2023 - May 2024

4K+ Student Records • 100+ Data Points
  • Healthcare ETL: Automated Python pipelines for research data normalization
  • Cloud Infrastructure: Administered hybrid cloud environment
  • Performance Optimization: Accelerated data availability for clinical teams
Python ETL SQL Cloud

TECHNICAL ARSENAL

Apache Spark

DATA ENGINEERING

Python SQL Java PySpark (Spark SQL) Databricks Delta Lake Iceberg Medallion Architecture
AWS

CLOUD & INFRA

AWS S3 Redshift EMR Kafka (Confluent) DynamoDB CloudWatch IAM

ORCHESTRATION & DEVOPS

Apache Airflow Astronomer DBT Terraform GitHub Actions Docker Monte Carlo Fivetran Looker

PROJECTS GRID

Apache Beam - London Cycles ETL
GCP Dataflow

Engineered scalable ETL pipelines on public datasets to visualize high-traffic cycling routes via Google Cloud SDK.

Apache Spark - 27M Movie Engine
AWS EMR / PySpark

Cosine Similarity engine processing 27M+ ratings to serve sub-second "Top 10" recommendations.

Apache Kafka - Market Simulator
Real-Time Streaming

Fault-tolerant streaming architecture simulating high-frequency stock data via Kafka Producers -> S3 Data Lakes.

Cyclistic Strategy Study
Data Analytics

End-to-end strategy proposal based on rider usage patterns using Pandas & Seaborn visualization.

TaskJarvis Automation
Python CLI

Custom OS-level productivity agent for automated scaffolding and file structuring.

Internet of Things Based UV Automobile Sanitization
Research Paper

Conference: 2021 International Conference on Smart Electronics and Communication (ICOSEC). DOI: 10.1109/ICOSEC51865.2021.9591934.

Citations: 3+ • References: 11+

EDUCATION

George Mason University

GEORGE MASON UNIVERSITY

📍 Fairfax, VA

M.S. Computer Science

GPA: 3.73/4.0

Databases • Distributed Systems • Big Data • ML • Algorithms

🎓

JNTU-K INDIA

B.S. Computer Science

GPA: 3.5/4.0

Software Engineering • Object-Oriented Programming • Systems Programming • API Design • Algorithms

REAL-TIME PROCESSING /// SCHEMA VALIDATION /// CI/CD /// MONITORING /// DATA QUALITY /// SLA MANAGEMENT /// REAL-TIME PROCESSING /// SCHEMA VALIDATION /// CI/CD /// MONITORING ///

LET'S BUILD

Gmail

EMAIL

charantatineni11@gmail.com

LinkedIn

LINKEDIN

linkedin.com/in/charantatineni

GitHub

GITHUB

github.com/charantatineni

HTML5

PORTFOLIO

tatineni.dev

STATUS: ● OPEN TO OPPORTUNITIES

LET'S TALK DATA