Career journey, education, and professional development
6+ Years Experience
Total Experience
6+ Years
Professional development
Companies
2
Different organizations
Certifications
4
Active certifications
Team Leadership
8+
Developers mentored
Senior Data Engineer
EY
Hong Kong
June 2022 - Present
Full-time
3+ years
EY AI & Data is the AI and Data department within EY Asia-Pacific that provides consulting services in data architecture, strategy, analytics, AI and data platforms. I have been engaged with two major clients in multiple projects, providing consulting and project delivery services.
Key Achievements:
Built an Enterprise Data Analytics Platform for one of the biggest non-profit organizations in Hong Kong to serve the AI use-cases
Led a team of 12 off-shore data engineers in delivering a large-scale enterprise data analytics platform
Designed and implemented a production-grade custom Data Validation and Data Quality product and successfully promoted it into adoption from client stakeholders
Built real-time streaming pipelines in Spark Structured Streaming, enabling streaming analytics of MDM customer data for a client in the banking industry
Enterprise Data Analytics PlatformInhouse ETL Framework360 Customer Data Mart
Data Scientist - Senior Data Scientist
JCDecaux
Paris, France
March 2019 - June 2022
Full-time
3 year 4 months
JCDecaux is a French MNC, known as the largest outdoor advertising company, operating in more than 80 countries in the world with a total headcount of 13,000 employees. As a Senior Data Scientist, I have worked in two of the most critical projects of the DataCorp department.
Key Achievements:
Lead the features definition, implementation and deployment of a product based on advertising visuals analysis with image processing and deep learning models, used by more than 20 business units across the world
Restructure critical legacy project of impressions estimation, resulting in a decrease of running cost and pipeline duration by more than 75%
Automate deployment of data solutions following standard SDLC practices using AWS CloudFormation, Gitlab CI/CD and containerization on AWS ECR and AWS ECS
Automate data pipelines on Managed Workflows for Apache Airflow through metadata-driven framework
Technologies Used:
ScalaApache SparkSETL FrameworkAWSDocker
Notable Projects:
Attention-based prediction modelAds reach estimationESG footprint assessment model