Update Chaoyi's talk

suyZhong · suyZhong · commit b95d33a917c6 · 2025-02-10T12:16:08.000+08:00
diff --git a/content/event/250212.md b/content/event/250212.md
@@ -0,0 +1,67 @@
+---
+title: "Efficient and Scalable Distributed LLM Training: Hiding Communication Overhead"
+
+event: Weekly Talk
+event_url: 
+
+location: COM3-B1-15 - Meeting Rm 92
+address:
+  street: 
+  city: 
+  region: 
+  postcode:
+  country: Singapore
+
+summary: 
+abstract: "Training Large Language Models (LLMs) is often inefficient due to high communication overhead, resulting in sub-50% Model FLOPS Utilization (MFU). In this talk, I will discuss how to build a cost-efficient and scalable machine learning system, using DHelix as an example. Inspired by the DNA double-helix structure, DHelix improves efficiency through Strand Interleaving (SI), which overlaps forward and backward passes to maximize computation-communication concurrency. It seamlessly integrates with all parallelism strategies, including pipeline parallelism via a model folding design.
+
+Experiments on Llama, GPT, and Phi MoE models across A40, A800, and H100 clusters demonstrate up to 58% MFU on A40 and 71% on A800, significantly outperforming state-of-the-art methods. I will explore DHelix’s design, optimization techniques, and its broader impact on distributed LLM training."
+
+# Talk start and end times.
+#   End time can optionally be hidden by prefixing the line with `#`.
+date: "2025-02-12T14:00:00Z"
+date_end: "2025-02-12T15:00:00Z"
+all_day: false
+
+# Schedule page publish date (NOT talk date).
+publishDate: "2017-01-01T00:00:00Z"
+
+authors: [Chaoyi Ruan]
+tags: [Weekly Talk]
+
+# Is this a featured talk? (true/false)
+featured: false
+
+image:
+  caption: 'Image credit: [**Unsplash**](https://unsplash.com/photos/bzdhc5b3Bxs)'
+  focal_point: Right
+
+url_code: ""
+url_pdf: ""
+url_slides: ""
+url_video: ""
+
+# Markdown Slides (optional).
+#   Associate this talk with Markdown slides.
+#   Simply enter your slide deck's filename without extension.
+#   E.g. `slides = "example-slides"` references `content/slides/example-slides.md`.
+#   Otherwise, set `slides = ""`.
+slides:
+
+# Projects (optional).
+#   Associate this post with one or more of your projects.
+#   Simply enter your project's folder or file name without extension.
+#   E.g. `projects = ["internal-project"]` references `content/project/deep-learning/index.md`.
+#   Otherwise, set `projects = []`.
+projects:
+
+# Slides can be added in a few ways:
+# 
+# - **Create** slides using Wowchemy's [*Slides*](https://wowchemy.com/docs/managing-content/#create-slides) feature and link using `slides` parameter in the front matter of the talk file
+# - **Upload** an existing slide deck to `static/` and link using `url_slides` parameter in the front matter of the talk file
+# - **Embed** your slides (e.g. Google Slides) or presentation video on this page using [shortcodes](https://wowchemy.com/docs/writing-markdown-latex/).
+# 
+# Further event details, including page elements such as image galleries, can be added to the body of this page.
+
+
+---