🎯
Focusing
Multimodal Learning and NLP | Visiting PhD in MIT |
PhD in THUNLP
-
MIT
- Boston
- https://jameshujy.github.io/
Pinned Loading
-
OpenBMB/VisCPM
OpenBMB/VisCPM Public[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
-
thunlp/ACDiT
thunlp/ACDiT PublicACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer
-
OpenVLG/DELLA
OpenVLG/DELLA PublicOfficial code for the NAACL 2022 paper "Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation"
-
huggingface/open-r1
huggingface/open-r1 PublicFully open reproduction of DeepSeek-R1
-
Deep-Agent/R1-V
Deep-Agent/R1-V PublicWitness the aha moment of VLM with less than $3.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.