Portfolio
A collection of professional projects and academic work.
Professional Experience
Data Engineer
- Designed and implemented scalable data pipelines using AWS services including S3, Glue, Lambda, and Athena.
- Integrated third-party APIs and modeled datasets for ingestion into an organizational data lake.
- Built automated workflows on Databricks to ensure reliable and timely data delivery.
- Applied data quality checks, validation logic, and transformations prior to client consumption.
- Developed analytical dashboards by aggregating data across geographical and temporal dimensions.
- Managed relational and transactional workloads using AWS Aurora and optimized SQL queries.
Cloud Engineer
- Managed and supported AWS cloud infrastructure for enterprise data teams.
- Optimized processing workflows on AWS Redshift clusters, improving efficiency by approximately 20%.
- Designed system architecture and disaster recovery strategies to ensure high availability.
- Collaborated with cross-functional stakeholders to translate business requirements into technical solutions.
Academic Projects
Natural Language Interface to Query Institutional Database
- Developed a platform for faculty to query the university database using spoken English.
- Leveraged NLTK library to process the input string via POS tagging to generate an abstract syntax tree.
- Queried the structure using Apache TinkerPop to execute it against a graph database.
- The user-friendly interface simplified the insight derivation process for faculty to better understand student performance metrics.
Technical Skills
- Cloud Platforms: Amazon Web Services (AWS), Google Cloud Platform
- Data Tools: Databricks, AWS Glue, Athena, Redshift, SageMaker
- Programming: Python, PySpark, SQL
- Certification: Introduction to Generative AI (Coursera)