CJ Sanon

CJ Sanon

Mentor
Rising Codementor
US$8.00
For every 15 mins
ABOUT ME
Data Engineer with experience scaling up startups
Data Engineer with experience scaling up startups

I am a Senior Data Engineer with experience in optimizing reports, developing ETL pipelines, and leading data team strategies. Skilled in Python, SQL, AWS, Docker, and data visualization tools.

English
Eastern Time (US & Canada) (-04:00)
Joined March 2025
EXPERTISE
5 years experience
5 years experience
4 years experience
4 years experience
4 years experience

REVIEWS FROM CLIENTS

CJ's profile has been carefully vetted and approved as a Codementor. Connect with CJ now, and leave a review for them once you're done!
SOCIAL PRESENCE
GitHub
S3-Data-Lake-ETL-Spark
An ETL pipeline using Spark to transform data from a data warehouse to a data lake.
Jupyter Notebook
1
0
CJSanon
Config files for my GitHub profile.
0
0
EMPLOYMENTS
Senior Data Engineer
NAVI
2023-05-01-Present

• Redesigned and optimized critical reports by replacing inefficient R scripts with production-grade Python and SQL, reducing run time...

• Redesigned and optimized critical reports by replacing inefficient R scripts with production-grade Python and SQL, reducing run times by up to 96%
• Collaborated with the data analytics team to develop new reports and reporting capabilities, securing additional clients and driving a 60% revenue increase
• Designed and led data team strategy for the large-scale migration of ETL pipelines from disparate in-house solutions to a Docker-based Dagster deployment, significantly improving data visibility, enabling detailed lineage tracking, accelerating metadata discovery for faster troubleshooting, simplifying development lifecycle, and standardizing coding practices across all environmentsData Engineer | May 2023 - Jan. 2025

Python
SQL
Docker
View more
Python
SQL
Docker
AWS
View more
Senior Data Engineer Consultant (Part-Time Contract)
HAYSTACK SEARCH
2021-10-01-Present

• Built and maintained Python and SQL-based ETL pipelines that ingest and clean over 25 million business records for a machine learnin...

• Built and maintained Python and SQL-based ETL pipelines that ingest and clean over 25 million business records for a machine learning application, ensuring data accuracy and scalability
• Deployed Docker-based Airflow to orchestrate Python and SQL scripts for data ingestion and cleansing in user search results, and created comprehensive documentation on setup, usage, and troubleshooting

Python
SQL
Docker
View more
Python
SQL
Docker
Airflow
View more
Data Software Engineer
SPYCLOUD
2023-06-01-2024-06-01

• Partnered with the internal cybercrime research team to scale breach data ingestion by updating legacy data systems for parsing auto...

• Partnered with the internal cybercrime research team to scale breach data ingestion by updating legacy data systems for parsing automation, achieving a 2–3x increase in ingest velocity
• Ingested over 100 million breach records across multiple data stores by parsing data with Linux, Bash, Python, and SQL
• Improved and maintained large-scale data ingestion pipelines—handling hundreds of millions of daily records—and provided on-call support to ensure reliable delivery for both internal analytics and external business products

Python
SQL
Linux
View more
Python
SQL
Linux
Bash
AWS
View more