Professional Experience

Career journey, education, and professional development

6+ Years Experience
Total Experience
6+ Years

Professional development

Companies
2

Different organizations

Certifications
4

Active certifications

Team Leadership
8+

Developers mentored

Senior Data Engineer

EY

Hong Kong
June 2022 - Present
Full-time
3+ years

EY AI & Data is the AI and Data department within EY Asia-Pacific that provides consulting services in data architecture, strategy, analytics, AI and data platforms. I have been engaged with two major clients in multiple projects, providing consulting and project delivery services.

Key Achievements:

  • Built an Enterprise Data Analytics Platform for one of the biggest non-profit organizations in Hong Kong to serve the AI use-cases
  • Led a team of 12 off-shore data engineers in delivering a large-scale enterprise data analytics platform
  • Designed and implemented a production-grade custom Data Validation and Data Quality product and successfully promoted it into adoption from client stakeholders
  • Built real-time streaming pipelines in Spark Structured Streaming, enabling streaming analytics of MDM customer data for a client in the banking industry

Technologies Used:

DatabricksPySparkPythonSQLServerAzureSpark Structured Streaming

Notable Projects:

Enterprise Data Analytics PlatformInhouse ETL Framework360 Customer Data Mart
Data Scientist - Senior Data Scientist

JCDecaux

Paris, France
March 2019 - June 2022
Full-time
3 year 4 months

JCDecaux is a French MNC, known as the largest outdoor advertising company, operating in more than 80 countries in the world with a total headcount of 13,000 employees. As a Senior Data Scientist, I have worked in two of the most critical projects of the DataCorp department.

Key Achievements:

  • Lead the features definition, implementation and deployment of a product based on advertising visuals analysis with image processing and deep learning models, used by more than 20 business units across the world
  • Restructure critical legacy project of impressions estimation, resulting in a decrease of running cost and pipeline duration by more than 75%
  • Automate deployment of data solutions following standard SDLC practices using AWS CloudFormation, Gitlab CI/CD and containerization on AWS ECR and AWS ECS
  • Automate data pipelines on Managed Workflows for Apache Airflow through metadata-driven framework

Technologies Used:

ScalaApache SparkSETL FrameworkAWSDocker

Notable Projects:

Attention-based prediction modelAds reach estimationESG footprint assessment model