YEO JIA QI
About Candidate
- Led migration of 2TB+ production data from ClickHouse to a distributed cloud data warehouse, reducing query latency by 25% and improving cross-system consistency.
- Designed and maintained batch + real-time ETL pipelines supporting near real-time BI and operational analytics.
- Built streaming ingestion pipelines (Flink → MaxCompute / Hologres) to power time-sensitive reporting workflows.
- Modeled analytics-ready fact and dimension tables in Hologres, enabling self-service Tableau dashboards for business users.
- Partnered with product and analytics teams to support A/B testing by designing experiment tracking schemas, validating metrics, and ensuring statistical data integrity.
- Reduced compute cost by 15% through SQL tuning, partition optimization, and resource monitoring automation (CPU/memory alerting).
- Developed Python-based data utilities and Redis-backed in-memory processing to improve pipeline throughput and latency.
- Implemented automated data validation and reconciliation checks to improve pipeline reliability.
Location
Education
B
Bachelor of Science with Honours in Decision Science
2020 - 2024
Universiti Utara Malaysia
Minor in Information Management.
FYP: Built ML-based sentiment analysis model (Python, NLTK, scikit-learn, VADER) on YouTube film review datasets.
Work & Experience
D
Data Engineer
APRIL 2025 - PRESENT
SNSoft Sdn Bhd
Built and optimized production-grade batch and streaming data pipelines to power BI dashboards and experiment analysis. Delivered measurable improvements in performance, cost efficiency, and data reliability while supporting A/B testing and analytics initiatives.
Q
QA Engineer (Intern)
MARCH 2024 - JULY 2024
BASSnet Sdn Bhd
Conducted functional and regression testing, validated deployment data accuracy, and partnered with developers to improve system stability and product quality.
