Professional Summary
Data Science student with a solid foundation in statistics and hands-on experience in geospatial analysis, LLM agent development, NLP, and network science. Proven ability to build end-to-end data pipelines, evaluate spatial reasoning models, and apply complex stochastic and geographic modeling to real-world datasets. Multilingual (Chinese, English, Japanese) with cross-functional research and internship experience in Japan and Hong Kong.
Technical Skills
Python (NumPy, Pandas, PyTorch, NetworkX)
R & Java
Geospatial Analysis (QGIS)
LLM/LMM Applications
Network Science & Data Pipelines
Software (LaTeX, Gephi)
Japanese (JLPT N1)
English (IELTS 6.5)
Work Experience
Data Engineering & Spatial Analysis Intern
- Conducted extensive literature analysis on geospatial data applications and predictive mobility modeling.
- Analyzed multi-dimensional geographic datasets, designing robust data pipelines for visualization and reporting.
- Engineered location-based features for commercial POIs by integrating urban, socioeconomic, and infrastructure data to build comprehensive geographic profiles.
- Evaluated spatial reasoning capabilities of Large Multimodal Models (LMMs) through prompt engineering, fusing satellite imagery with structured geographic contexts.
Tech Stack: Python, Pandas, GeoPandas, QGIS, LMM APIs (e.g., GPT-4V)
Teaching Assistant - Prescriptive Decision Analytics (MIS204)
Instructor: Asst. Prof. Liyi Gu
- Tutored students on core operations research topics, including linear programming, network models, inventory theory, and decision analysis.
- Assisted the instructor in grading assignments and quizzes, ensuring clear communication of complex mathematical concepts.
Research & Project Experience
Visiting Research Assistant - Social-Interaction-Aware LLM Agents
Advisor: Prof. Hiroki Hill Kobayashi
- Designed a tool-calling LLM agent architecture to retrieve historical trajectory data and generate plausible, individualized mobility sequences.
- Built a geospatial Data Converter tool utilizing point-in-polygon spatial joins to embed Japan level-3 administrative region descriptions into raw user trajectory coordinates.
- Developed "Preference Statistics" and "Trend Context" modules to extract long-term geographical preferences and short-term behavioral trends from hourly spatiotemporal data.
- Integrated GIS-based distance calculation tools to verify constraints and improve the spatial plausibility of the generated mobility sequences.
Tech Stack: Python, LangChain, GeoPandas, Shapely, LLM APIs
NLP Researcher - Entity Extraction & Collaboration Network Construction
Mentor: Asst. Prof. Shihui Feng
- Applied Large Language Models (LLMs) to extract entities and relationships from large-scale COVID-19-related news.
- Constructed organization-level collaboration networks and optimized data processing workflows.
Tech Stack: Python, Hugging Face, NetworkX, Gephi
Research Project - Concept Embedding and Author Network Analysis
Mentor: Asst. Prof. Yifang Ma
- Developed concept embedding methods to analyze academic domains and author performance.
- Explored relationships between author features and their academic impact using vector representations.
Tech Stack: Python, PyTorch, Word2Vec, NetworkX
Course Project - The Small-World Phenomenon under a Restricted Kleinberg Model
Mentor: Assoc. Prof. Yanging Hu
- Introduced directional constraints in Kleinberg's model to examine connectivity patterns in small-world networks. Co-authored course paper.
Tech Stack: Python, NetworkX, NumPy
Education
M.Sc. in Data Science for Sustainability
B.S. in Statistics | GPA: 3.49/4.00
Advisor: Asst. Prof. Yifang Ma
Relevant Coursework: Probability Theory, Stochastic Processes, Mathematical Statistics, Linear Algebra, Real Analysis, Network Science, Computing