About
Highly accomplished Data Science Manager with over 7 years of expertise in developing and scaling AI/ML solutions, advanced analytics, and data architecture across pharmaceutical and financial industries. Proven leader in driving business insights, automating processes, and delivering high-impact projects, including a 50% increase in operational efficiency and the successful acquisition of 7 million new customers. Adept at leveraging large language models, NLP, and cloud platforms to transform complex data into actionable strategies and production-ready products, consistently earning strong recognition from top management.
Work
MSD China
|Data Science Manager, Advanced Analytics Team
Shanghai, Shanghai, China
→
Summary
Leading the development and scaling of AI solutions, from proof-of-concept to production, for a top global pharmaceutical company, driving innovation and business impact.
Highlights
Developed an AI solution POC for social media data using RAG, NLP, and machine learning, transforming raw data into an interactive web application and earning strong recognition from top management.
Designed and scaled the AI architecture, leading a 2-engineer team to transition a critical POC into a production-ready product using FastAPI, AWS Native Services (RDS, Glue, Lambda, S3), CI/CD, and Next.js.
Automated business insights by leveraging generative AI for Text2SQL queries and dashboard creation, significantly improving data analysis and decision-making speed.
Fostered internal innovation and broader AI adoption by collaborating with international partners to evaluate AI's impact across pharmaceutical and commercial projects.
PwC Info. Technologies
|Senior Data Scientist, AI & Data Analytics Team
Shanghai, Shanghai, China
→
Summary
Led generative AI projects and developed AI-based platforms for clinical trial automation, delivering customized machine learning solutions for global clients.
Highlights
Led junior data scientists in China and India to develop diverse question answering systems leveraging Large Language Models, Langchain, and vector databases for various business scenarios.
Developed an AI platform for automating clinical trial lifecycles for major US pharmacy companies, implementing NLP techniques (NER, relation extraction) with BERT, AWS, and Docker.
Designed customized solution architectures and applied machine learning algorithms for anomaly detection and user conversion prediction, collaborating cross-functionally in SCRUM mode to accelerate sales.
Organized and led junior coworkers in preparing POC demonstrations for business pitches, including Generative AI and recommender systems, and regularly shared technical knowledge across diverse audiences.
Tencent
|Algorithm Engineer, Risk Management and Analytics Center
Shenzhen, Guangdong, China
→
Summary
Drove high-impact anomaly detection projects for over 1 billion users and delivered end-to-end data solutions for overseas products, significantly enhancing operational efficiency.
Highlights
Led a high-impact anomaly detection project for over 1 billion WeChat Pay customers, applying social network analysis, deep learning, ML, and NLP, recognized for its successful implementation.
Delivered comprehensive end-to-end data solutions for overseas products, covering data wrangling, model building, strategy development, and automated deployment and visualization.
Collaborated with cross-functional teams to enhance in-house products for qualitative review, resulting in a significant 50% increase in operational efficiency.
Facilitated effective collaboration by communicating with external consultants and providing presentations to overseas partners for payment license acquisition in Europe.
Agricultural Bank of China
|Data Scientist, Data Science Team
Shanghai, Shanghai, China
→
Summary
Leveraged statistical analysis and machine learning to identify target customers, develop fraud detection models, and establish robust tracking systems, driving significant customer acquisition and growth.
Highlights
Leveraged statistical analysis and machine learning models to identify 150 million target customers, leading to the acquisition of 7 million new customers and substantial growth.
Developed scorecard models for credit evaluation and fraud detection, balancing risk, user experience, and revenue, and collaborated cross-functionally to validate and automate these models.
Created robust tracking and reporting systems to monitor and analyze key metrics, regularly presenting data-driven insights and recommendations to board members.
Provided comprehensive data analysis training to coworkers nationwide, fostering a data-driven culture and empowering colleagues with valuable analytical skills.
Education
Uppsala University
→
Master of Science
Statistics
Courses
Full Scholarship Recipient (CSC Scholarship and Uppsala Global Merit Scholarship)
Central University of Finance and Economics
→
Bachelor of Economics
Statistics
Courses
Excellent Student, Second Class of All-round Development Scholarship Recipient
Languages
Chinese (Native)
English (Fluent)
Certificates
Azure AI Engineer Associate
Issued By
Microsoft
Skills
Programming Languages
Python, R, SAS.
Data Processing & Analytics
PySpark, Linux, SQL, NLP, Machine Learning, Deep Learning, Statistical Analysis, Generative AI, Recommender Systems, Knowledge Graphs, Anomaly Detection, User Conversion Prediction, Text2SQL, Dashboard Creation, Data Wrangling, Data Visualization, Data Analysis, Metric Monitoring.
Cloud Platforms
AWS Native Services, SageMaker, S3, EC2, Glue, Lambda Function.
Databases
Hive, MySQL, PostgreSQL.
MLOps & DevOps
Docker, CI/CD, FastAPI.
Web Frameworks
Next.js.
Business Intelligence Tools
Xiaoma BI (Tableau-like tool).
AI/ML Frameworks & Models
Large Language Model (LLM), Langchain, BERT, RAG (Retrieval-Augmented Generation), Vector Database.