🙋♀️ About me
Hi! I’m Evelyn, a Master’s student in Machine Learning and Data Science at Northwestern University, previously graduated from UC San Diego with degrees in Data Science and Joint Mathematics and Economics. I’m passionate about combining data analytics with my domain knowledge to craft AI-powered solutions that reflect my commitment to thoughtfulness, responsibility, and humanity.
When I’m not deep in data, you can find me enjoying a good movie, catching a stunning sunset at La Jolla Shores (it’s the best!), or making myself to a fresh manicure. I’m all about finding joy in the little things and keeping life vibrant and fun! ✨
📖 Educations
- 2025.08 - 2026.12: MLDS, Northwestern University
- 2019.09 - 2025.06: Bachelor of Science, University of California, San Diego
💻📝 Internships
- 2024.08 - 2024.09: Data Analyst Intern at POIZON
- 2023.08 - 2023.10 : Business Analyst Intern at Arthur D. Little
🔬 Research
- 2024.04 - present: Undergraduate Researcher at MixLab @ UCSD HDSI
Do Vision-Language Models Have an Internal World Model? Towards an Atomic Evaluation
Qiyue Gao*, Xinyu Pi*, Kevin Liu, Junrong Chen, Ruolan Yang, Xinqi Huang, Xinyu Fang, Lu Sun, et al.
ACL 2025
- 2024.01 - 2024.04 : Undergraduate Researcher at The Economic Research Lab
🧠 Projects
Financial market volatility, driven by economic conditions, corporate shocks, and investor behavior, makes stock return forecasting highly challenging.
Traditional methods, such as macroeconomic analysis and sentiment analysis, have limitations in capturing the complex relationships between stock returns and external factors.
Our goal is to develop an advanced stock return prediction framework that leverages causal discovery algorithms to enhance interpretability and accuracy.
Storytelling Visualization: Campaigning on Twitter
Explore the “word war” of the 2016 election through this storytelling project! Using NLP, sentiment analysis, and interactive visualizations, we uncovered how candidates turned Twitter into a campaign battlefield, shaping the outcome one tweet at a time.
Predictive task: Advancing Multimodal Sentiment Analysis through Enhanced Sarcasm Detection
Sarcasm Hard to Catch? Not Anymore! This project combines text and audio to detect sarcasm, making sentiment analysis easier. We thoughtfully designed every step, from data collection and feature engineering to model building. With our sarcasm detection model, we’ve cracked the code of complex emotions. Because sometimes, tone says it all!
Casual Discovery: Algorithm Comparison on Simulated Mental Health and Remote Work data
Dive into this project to discover how well different causal algorithms uncover relationships between variables! Explore how a person’s work enviroment, daily habits and mental health connect in our simulated dataset.
Data Minning: Used Car Price Prediction
Ever wondered what drives the price of a second-hand car? We built an XGB Regressor that beat baseline models by over 50% in RMSE. From mileage to color quirks, we unraveled the secrets behind used car pricing with data minning! 🚗✨
Storytelling Visualization: Pattern of US City Names
Discover how US cities got their names from around the world! Explore migration and immigration patterns through the checkboxes, hover to the points for details, and read the story to see how history shaped these connections.
Explore more projects in my GitHub page! Dive in and join me as I continue creating, learning, and growing! 🚀✨
👀 Something else