Liu Zhihao | Big Data Development Engineer | 5 Years of Experience
Hometown: Luzhou, Sichuan
Education: Bachelor's Degree
Age: 28
Phone: 177-6062-0226
Website: lzhgy.cn
Email: lzhgy163@163.com
Professional Summary
Big data developer with 5 years of experience, specializing in data warehouse construction, real-time data processing, and big data platform operations. Proficient in Java ecosystem tools, including Flink, ClickHouse, StarRocks, and mainstream databases (MySQL, Oracle, PostgreSQL). Led multiple core data projects involving trillion-level data migration and city safety early-warning systems.
Key Projects
Domestic: Maimeng Short Drama App (Available on app stores)
Global: NetShort App (Available on app stores)
Technical Skills
- Languages: Java (SpringBoot, MyBatis), Python, Shell
- Big Data: Flink, Kafka, Hive, HBase, StarRocks, ClickHouse, Maxwell, Seatunnel
- Databases: MySQL, PostgreSQL, Oracle, Redis
- Data Sync & ETL: FlinkCDC, Sqoop, Flume
- Scheduling & Monitoring: DolphinScheduler, Prometheus, Zookeeper
- Data Warehousing: Layer design (ODS/DWD/DWS/ADS), data quality audit, query optimization
- DevOps: Linux, server monitoring, data backup, disaster recovery
Work Experience
Nov 2024 - Present
Tech Stack: SpringBoot, MyBatis, Flink, StarRocks, Redis, HBase
- Built real-time user behavior analytics systems for decision-making.
- Designed Flink-based real-time data warehouse (ODS/DWD/DWS/ADS).
- Implemented CEP algorithms to detect security risks (e.g., 5 consecutive failed logins).
- Optimized search recommendations using IK Analyzer for hot-word analysis.
- Improved query efficiency via StarRocks data modeling and visualized business metrics.
Data Development Team Lead | ChengTou (Municipal Project)
Mar 2023 - Sep 2024
Tech Stack: Flink, ClickHouse, PostgreSQL, Kafka, Zookeeper, Redis, DolphinScheduler, Apisix
- Led city safety risk monitoring platform for government agencies, processing 2B+ daily sensor data.
- Optimized ClickHouse clusters (sharding/partitioning) across 130+ cloud servers.
- Managed trillion-level data migration to domestic infrastructure with zero consistency loss.
- Developed abnormal vehicle trajectory models (e.g., high-risk zones, prolonged stops).
- Achievement: Prevented 252 gas leakage incidents through real-time alerts.
Data Engineer | National Big Data Project
Oct 2021 - Jan 2023
Tech Stack: Nginx, Seatunnel, Kafka, MySQL, HBase, Redis, StarRocks, NebulaGraph
- Enhanced data flow stability for a classified big data platform.
Project Highlights
NetShort User Profile System
Architecture: SpringBoot + MyBatis + Spark + MySQL + StarRocks + Redis
- Designed tag-based user segmentation (demographics, consumption patterns).
- Developed Spark-based tag calculation workflows and stored results in StarRocks.
- Implemented user clustering via SpringBoot and Spark MLlib algorithms.
- Exported target user groups to Redis for personalized campaigns.
NetShort Real-Time Analytics System
Architecture: Nginx + Kafka + MySQL + HBase + Redis + Flink + StarRocks + Maxwell
- Built a 4-layer real-time warehouse (ODS→DWD→DWS→ADS) using Flink.
- Integrated dynamic data routing, CEP for login anomalies, and IK-based search analytics.
- Enabled real-time dashboards via StarRocks aggregation.
Architecture: Flink + ClickHouse + PostgreSQL + Kafka + Zookeeper + Redis + Apisix
- Processed 20B+ daily IoT sensor data with ClickHouse optimizations.
- Orchestrated ETL pipelines via DolphinScheduler and ensured 99.9% platform uptime.
- Achieved zero downtime during trillion-record migration to domestic infrastructure.
Self-Evaluation
- Proven expertise in end-to-end big data architecture design (0→1 systems).
- Strong capability in petabyte-scale data processing and real-time computing.
- Deep focus on data quality assurance and performance tuning.
- Experienced in cross-functional team leadership and project delivery.