About meHere's my story.
I’m Yun Zhou, a software engineer, who is passionate about building and supporting the fundamental data infrastructure across distributed systems. My mission is to orchestrate and transform massive volumes of real-time data into AI/ML-ready data warehouses in the cloud, enabling advanced AGI and LLM development.
I earned my master's degree in information science from the College of Computing and Information Science (CIS) at Cornell University. After graduation, I joined a tech unicorn startup, Flexport, as a software engineer. In the beginning, I built and maintained the CI/CD pipelines for both the monolithic application and microservices, improved the local development experience in k8s environment for GraphQL Gateway, and facilitated several devops migration projects.
Now, I am working on the infrastructure team to manage the lifecycle of real-time data, scaling to petabytes, including data ingestion and transformation using Kafka, dbt, and Snowflake. I build and maintain data pipelines for our cloud data warehouse, supporting data scientists and analysts in implementing AI and machine learning models, as well as creating dashboards for business intelligence. Additionally, I am leading several migration projects to consolidate tooling within the company, aiming to enhance the developer experience and improve operational outcomes.
In my graduate career, I worked as a full stack software engineer in the Department of Computational Biology at Cornell where I collaborated with designers and other software engineers to construct an AI/ML-ready database for molecular epidemiology of Mtb, and implement publicly available web services. Meanwhile, I interned at a start-up, Intern Guys, as a backend developer to implement the Recruiter Portal and database based on .NET framework, aiming to build a streamlined internship recruiting process for both candidates and employers.
Prior to Cornell, I worked full-time as a business analyst for four years at a leading health care provider and insurance company in the state of Minnesota in the United States, HealthPartners, where I architected and implemented inventory and finance databases for the pharmacy department, designed and developed hundreds of dashboards and reports for over 50 retail pharmacies, clinics, and hospitals, and scripted advanced SQL to retrieve and integrate Electronic Health Record (EHR) from enterprise data warehouse to assist not only healthcare leaders to understand financial and operational pictures of the organization, but also doctors and clinical managers to help millions of patients.
Before HealthPartners, I studied statistics in my undergraduate career at the University of Minnesota-Twin Cities where I made my foray into statistical theories, supervised and unsupervised machine learning algorithms, predictive data analytics, and so on. Outside the classroom, I took statistics and analytics internships in a variety of places, such as State of Minnesota, Real Avid, Minnesota Population Center, and Flatrate Moving. In my spare time, I enjoyed volunteering and tutoring mathematics and statistics in multiple places, with the goal of fostering leadership and academic success in teenagers (Youthrive) and minorities in the STEM fields (MCAE, CommonBond Communities).
In my early career, I have experienced multidisciplinary jobs in diverse industries, and found my true passion lying in converting business into AI solutions that can help people live a better life. In my spare time, I love to meet new people and hear their stories and insights on the tech industry. Please do not hesitate to connect with me on LinkedIn. You can find my past projects on GitHub or Projects.
Work Experience
In the most recent five years