🙋‍♀️ About me

Hi! I’m Evelyn, a Master’s student in the Machine Learning and Data Science (MLDS) program at Northwestern University, previously graduating from UC San Diego with dual degree in Data Science and Mathematics & Economics. I’m passionate about leveraging data analytics and domain expertise to build AI solutions that are thoughtful, responsible, and human-centered.

I am seeking a 2026 Summer Internship as a Data Scientist, Machine Learning Engineer, Data Analyst, or Business Intelligence Engineer. Driven by curiosity and a commitment to lifelong learning, I continuously explore new technologies, frameworks, and methodologies to stay at the forefront of AI and machine learning innovation.

Please feel free to reach out at xinqihuang2026@u.northwestern.edu if my experience and skill set align with opportunities you’d like to discuss.

When I’m not deep in data, you can find me enjoying a good movie, watching sunsets wherever I am (La Jolla Shores is still my favorite!), or giving myself a fresh manicure. I’m all about finding joy in the little things and keeping life vibrant and fun! ✨

📖 Educations

  • 2025.09 - 2026.12: Machine Learning and Data Science (MLDS), Northwestern University
  • 2021.09 - 2025.06: Bachelor of Science in Data Science and Joint Mathematics–Economics, UC San Diego

💻📝 Internships and Experiences

  • 2025.09 - Present: Data Scientist (Industry Practicum) at Mintel
  • 2024.07 - 2024.09: Data Science Intern at POIZON
  • 2023.07 - 2023.09 : Business Analyst Intern at Arthur D. Little

🔬 Research and Publications

1. Do Vision-Language Models Have an Internal World Model? Towards an Atomic Evaluation

Qiyue Gao*, Xinyu Pi*, Kevin Liu, Junrong Chen, Ruolan Yang, Xinqi Huang, Xinyu Fang, Lu Sun, Gautham Kishore, Bo Ai, Stone Tao, Mengyang Liu, Jiaxi Yang, Chao-Jung Lai, Chuanyang Jin, Jiannan Xiang, Benhao Huang, Zeming Chen, David Danks, Hao Su, Tianmin Shu, Ziqiao Ma, Lianhui Qin, Zhiting Hu

ACL 2025 Findings

arXiv:2506.21876

Website

2. PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

PAN Team, Institute of Foundation Models, as contributor to the data and evaluation section

paper

Pan World Model Website

3. World Reasoning Arena

PAN Team Institute of Foundation Models: Qiyue Gao*, Kun Zhou*, Jiannan Xiang*, Zihan Liu*, Dequan Yang, Junrong Chen, Arif Ahmad, Cong Zeng, Ganesh Bannur, Xinqi Huang, Zheqi Liu, Yi Gu, Yichi Yang, Guangyi Liu, Zhiting Hu, Zhengzhong Liu, Eric Xing

paper

🧠 Projects

Project Image

VLM Benchmark: WM-ABench

Using Maniskill simualtor, we generated 37K+ image-based Q&A scenarios for benchmarking 11 commercial and open-source Vision-Language Models (including GPT, Gemini, Claude, and Qwen) perception and prediction capability for downstream robotic manipulation tasks.

link to video demo

link to dataset

link to code

Project Image

ETL and Data Visualization: Ebird spot map

Explore real-time bird migration through an automated data pipeline! Using ebird api, Apache Airflow, PostgreSQL, Docker, SQL, HTML, Streamlit and Prediction Modeling, we built an end-to-end system delivering up-to-date bird location insights with role-specific visualizations and a reliable two-stage deployment.

link to github

Streamlit app demo

Demo map

Project Image

Causal Discovery and Deep learning: Ensemble Deep Learning for Stock Return Prediction in Volatile Markets

Financial market volatility, driven by economic conditions, corporate shocks, and investor behavior, makes stock return forecasting highly challenging.

Traditional methods, such as macroeconomic analysis and sentiment analysis, have limitations in capturing the complex relationships between stock returns and external factors.

Our goal is to develop an advanced stock return prediction framework that leverages causal discovery algorithms to enhance interpretability and accuracy.

link to github

link to report

link to poster

Project Image

Storytelling Visualization: Campaigning on Twitter

Explore the “word war” of the 2016 election through this storytelling project! Using NLP, sentiment analysis, and interactive visualizations, we uncovered how candidates turned Twitter into a campaign battlefield, shaping the outcome one tweet at a time.

A Demo

Project Image

Predictive task: Advancing Multimodal Sentiment Analysis through Enhanced Sarcasm Detection

Sarcasm Hard to Catch? Not Anymore! This project combines text and audio to detect sarcasm, making sentiment analysis easier. We thoughtfully designed every step, from data collection and feature engineering to model building. With our sarcasm detection model, we’ve cracked the code of complex emotions. Because sometimes, tone says it all!

Project Image

Casual Discovery: Algorithm Comparison on Simulated Mental Health and Remote Work data

Dive into this project to discover how well different causal algorithms uncover relationships between variables! Explore how a person’s work enviroment, daily habits and mental health connect in our simulated dataset.

Project Image

Data Minning: Used Car Price Prediction

Ever wondered what drives the price of a second-hand car? We built an XGB Regressor that beat baseline models by over 50% in RMSE. From mileage to color quirks, we unraveled the secrets behind used car pricing with data minning! 🚗✨

Project Image

Storytelling Visualization: Pattern of US City Names

Discover how US cities got their names from around the world! Explore migration and immigration patterns through the checkboxes, hover to the points for details, and read the story to see how history shaped these connections.

Explore more projects in my GitHub page! Dive in and join me as I continue creating, learning, and growing! 🚀✨

👀 Something else