YEO JIA QI

Data Engineer
July 30, 2000

About Candidate

  • Led migration of 2TB+ production data from ClickHouse to a distributed cloud data warehouse, reducing query latency by 25% and improving cross-system consistency.
  • Designed and maintained batch + real-time ETL pipelines supporting near real-time BI and operational analytics.
  • Built streaming ingestion pipelines (Flink → MaxCompute / Hologres) to power time-sensitive reporting workflows.
  • Modeled analytics-ready fact and dimension tables in Hologres, enabling self-service Tableau dashboards for business users.
  • Partnered with product and analytics teams to support A/B testing by designing experiment tracking schemas, validating metrics, and ensuring statistical data integrity.
  • Reduced compute cost by 15% through SQL tuning, partition optimization, and resource monitoring automation (CPU/memory alerting).
  • Developed Python-based data utilities and Redis-backed in-memory processing to improve pipeline throughput and latency.
  • Implemented automated data validation and reconciliation checks to improve pipeline reliability.

Location

Education

B
Bachelor of Science with Honours in Decision Science 2020 - 2024
Universiti Utara Malaysia

Minor in Information Management.

FYP: Built ML-based sentiment analysis model (Python, NLTK, scikit-learn, VADER) on YouTube film review datasets.

Work & Experience

D
Data Engineer APRIL 2025 - PRESENT
SNSoft Sdn Bhd

Built and optimized production-grade batch and streaming data pipelines to power BI dashboards and experiment analysis. Delivered measurable improvements in performance, cost efficiency, and data reliability while supporting A/B testing and analytics initiatives.

Q
QA Engineer (Intern) MARCH 2024 - JULY 2024
BASSnet Sdn Bhd

Conducted functional and regression testing, validated deployment data accuracy, and partnered with developers to improve system stability and product quality.

Skills

Python
67%
SQL
90%
Java
65%
ETL
65%
Data Engineering
77%