Notes

Research notes, technical writeups, and occasional reflections.

Looking for pedagogical notes from my PhD? They live at minhuanli.github.io/notes.

Flow-Matching Objectives

In the previous blog, I walked through the simulation-based approaches to train the neural ODE/continuous normalizing flow models. Those approaches are mathematically elegant, while they are still expensive and non-scalable in practice. Flow-matching objectives are targets to make the training more affordable and scalable. In this blog, I will review the derivations behind flow-matching models.

5 min read · May 27, 2024

2024 · AI&Physics
Training Neural ODE with three different loss types

The recent popular flow-matching models are based on another interesting model group called Neural ODE/continuous normalizing flow. While the main idea behind flow-matching models is to find a practical and affordable way to train the neural ODE, the original adjoint sensitivity method is actually very intellectually interesting and full of meaningful details. So, in this blog, I'll review the derivations behind the adjoint method before diving into the flow-matching objective in the next one. At the end, they are both good candiates of protocols to make observables from MD trajectories differentiable.

6 min read · May 13, 2024

2024 · AI&Physics
Implicit Reparameterization Gradients

This note delves into a paper recommended by Kevin, which focuses on the challenges of obtaining low-variance gradients for continuous random variables, particularly those pesky distributions we often encounter (yes, the Rice distribution). Key takeaway, you can have unbiased estimators for pathwise gradients of continuous distributions with numerically tractable CDFs, like gamma, truncated, or mixtures.

7 min read · September 12, 2023

2023 · AI&Physics
An obscure reason of GPU memory leak in pytorch

A short debug note on why I kept getting "CUDA out of memory" error in my codes. Main takeaway is, don't use in-place operations in your computing graph unless necessary. If you are applying it to non-leaf tensors, change it even it seems necessary. I tested on both 1.13 and 2.0, with cuda version 11.6 and 11.7.

2 min read · May 08, 2023

2023 · tech
Configure A macOS with M1 chip From Scratch

A walk-through note on how to configure my familiar working system from a brand new macOS system with M1 chip, including Git token, Homebrew, Terminal color theme, Oh-my-zsh plugins, and conda. Compared to the previous post for an Intel chip, the difference mainly lies in the Homebrew PATH. I also use mambaforge to replace miniconda for python environment management.

8 min read · July 12, 2022

2022 · tech

Notes

Research notes, technical writeups, and occasional reflections.

Flow-Matching Objectives

Training Neural ODE with three different loss types

Implicit Reparameterization Gradients

An obscure reason of GPU memory leak in pytorch

Configure A macOS with M1 chip From Scratch