My Experience

Knowledge gives us the tools to dream, and experience gives us the courage to achieve those dreams.

Work Experience

Company LogoQuid Inc.
Title: Machine Learning Engineering Intern
Mentor: Larrick Chen
Team: Discover Analysis
Dates: Dec. 2024 - Jun. 2025
Company LogoAcademia Sinica
Title: Research Assistant
Mentor: Prof. Li Su
Dates: Jun. 2024 - Aug. 2025
Company LogoCIeNET Technology
Title: Engineering Intern
Mentor: Jimmy Hsieh
Dates: Dec. 2023 - Jun. 2024
Company LogoNational Science and Technology Council (NSTC)
Title: Undergraduate Researcher
Dates: Sep. 2023 – Apr. 2024

Research Experience

Music and Audio Computing Lab, Academia Sinica
Dates: Jan. 2024 - Aug. 2025
Projects:
  • Proposed a novel end-to-end factorized codec learning framework for timbre/style transfer models with information perturbation and supervision, achieving enhanced timbre-content-ADSR disentanglement for controllable synthesizer preset conversion and surpassing state-of-the-art synthesizer timbre transfer baselines with a multi-resolution STFT loss from 5.69 to 2.22. [GitHub]
  • Developed an audio-query music source separation system using band-split Mamba2 with hypernetwork conditioning, enhancing timbre conditioning and boosting instrument-specific SNR by 7%.
Productivity Optimization Lab, NTU
Dates: Dec. 2023 - Dec. 2024
Projects:
  • Collaborated with a team of 6 to develop a GraphRAG-based news content analysis tool, leveraging LLMs for insight extraction from large datasets and reducing manual effort in social science research. [GitHub]
  • Applied NLP techniques such as LDA and NMF to analyze attitude shifts surrounding the 2021 Atlanta spa shootings.
Geospatial Computing Lab, NTU
Dates: Jan. 2022 - Jun. 2024
Projects:
  • [B.S. Thesis] Trip-purpose-based methods for predicting human mobility’s next location
  • Developed multimodal spatio-temporal models with a trip-purpose approach, achieving 80% accuracy, and designed a data pipeline to integrate mobility data, Google Maps polygons, and remote sensing datasets for spatial analysis.
  • Reached IMV contest semifinals by building a digital transaction platform with TypeScript (React), Node.js, and MongoDB, and developing a Selenium web crawler for real-time vegetable prices to optimize fertilizer use. [GitHub]
  • Simulated 3D crowd and vehicle flows in NetLogo and Python for safer Taipei Dome evacuations, informing exit planning.

Teaching Assistant Experience

Machine Learning, NTU
Dates: Jan. 2023 – Jan. 2024 (2 semesters)
Content:
  • Designed assignments & projects, and led TA sessions in English to support students with problem-solving and queries.
  • Conceived and led the final project, originating from my idea. Utilized Generative Adversarial Networks(GANs) to generate a noisier dataset based on the original, increasing task difficulty in a student final project, then applied machine learning models to establish baselines.
Computer Programming, NTU
Dates: Feb. 2022 – Jun. 2022
Content:
  • Led coding exercises, explained programming logic, and graded assignments & exams.

Open Review

NeurIPS AI Music 2025
Conference: NeurIPS Workshop on AI for Music
Dates: 2025
Description:
  • Reviewing submissions for the NeurIPS Workshop on AI for Music 2025.
ICASSP 2026
Conference: IEEE International Conference on Acoustics, Speech and Signal Processing
Dates: 2026
Description:
  • Reviewing submissions for ICASSP 2026.

Skills

C/C++

C/C++

Python

Python

Javascript

Javascript

Typescript

Typescript

C#

C#

Golang

Golang

HTML

HTML

CSS

CSS

PyTorch

PyTorch

Tensorflow

Tensorflow

HuggingFace

HuggingFace

OpenCV

OpenCV

Flask

Flask

React

React

Next.js

Next.js

NodeJS

NodeJS

MongoDB

MongoDB

MySQL

MySQL

PostgreSQL

PostgreSQL

GoogleAPI

GoogleAPI

Docker

Docker

Git

Git

NetLogo

NetLogo

Research Interest

  • Machine Learning
  • Representation Learning
  • Sound Separation
  • Text-to-Music Generation
  • Unsupervised Learning
  • Natural Language Processing