cv
This is a web layout of my resumé. You can also download my resumé with the PDF button. ↑
Basics
Name | Zerui "Jerry" Ma |
Label | Prospect Ph.D. Student |
jerryma@smu.edu | |
Phone | (747) 289-8602 |
Url | https://jerma88.github.io/ |
Summary | Currently finishing a Bachelor in CS, Math and Data Science. Interested in academic research. Click the PDF icon to download my resume. |
Interests
Machine Learning | |
Neural Networks | |
Natural Language Processing | |
Transformer-based Models | |
Large Language Models | |
Computational Linguistics | |
Parallel Programming | |
Theoretical Machine Learning | |
Machine Learning with Medical Data | |
Deep Learning | |
Computer Vision | |
Quantum Computing |
Education
-
Aug 2022 - May 2025 Dallas, Texas
Research experience
Fellowship grants
Teaching experience
Projects
- Dec 2023 - Ongoing
Personality LLM Research
Using PyTorch and Transformer to fine-tune a custom LLM to predict Big-Five Personality Traits. Involved in preprocessing real-life interview data and fine-tuning BERT and Llama models.
- Extracted fine-tuned roBERTa embeddings for Recurrent Neural Networks training.
- Ran batch jobs on HPC and managed complex data structures using Ubuntu Linux commands.
- June 2024 - Ongoing
Individual Research - Recommender Systems
Conducting research as part of the Robert Mayer Undergraduate Research Fellowship Program, building a recommendation system using Set-Cover Algorithms to improve university curriculum advising.
- Built a relational database with web-scraped data.
- Developed a system for retrieving accurate course recommendations based on student history.
- July 2024 - July 2024
Governor's Champion Summer Camp
Taught a course for high school students on AI and machine learning, covering topics such as intelligent agents, data preprocessing, and algorithms.
- Developed the curriculum and coordinated with colleagues to deliver engaging hands-on projects.
- June 2024 - Aug 2024
RAG Application Full Stack Development
Developed a medical transcript summarization tool using LangChain and Python. Hosted a vector database to enhance LLM-based medical diagnosis using GPT-4.
- Increased medical diagnosis efficiency by streamlining review of patient histories.
Skills
Programming | |
Python | |
C++ | |
Java | |
Shell Script | |
SQL | |
STM/MIPS ASM | |
JS/HTML/CSS | |
Swift |
AI/ML | |
PyTorch | |
scikit-learn | |
HPC | |
batch job | |
Dask | |
YOLO | |
DDP | |
VIM |
DevOps | |
Git/GitHub | |
REST API | |
Jekyll | |
GNU | |
Valgrind | |
Docker | |
Jupyter | |
Azure |
Operating Systems & Other | |
Linux (Arch, Ubuntu) | |
Windows 7-11 | |
MacOS | |
LaTeX | |
Markdown | |
R | |
Web Scraping | |
Microsoft Office |
Work experience
Languages
Chinese | |
Native speaker |
English | |
Fluent |
Spanish | |
Conversational |