Skip to content

Commit

Permalink
add references
Browse files Browse the repository at this point in the history
  • Loading branch information
goyalpramod committed Feb 17, 2025
1 parent b51f9e5 commit d2be687
Showing 1 changed file with 32 additions and 12 deletions.
44 changes: 32 additions & 12 deletions _blogs/future_blogs_to_write.markdown
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@ image: [add image]
---
- Evolution of LLMs
- [Flash Attention Blog](https://gordicaleksa.medium.com/eli5-flash-attention-5c44017022ad)
- [Flash Attention 3](https://tridao.me/blog/2024/flash3/)
- [Mamba](https://tridao.me/blog/)
Expand All @@ -23,8 +24,8 @@ image: [add image]
- [GPT in 60 lines](https://jaykmody.com/blog/gpt-from-scratch)
- [A Brief History of Large Language Models](https://medium.com/@bradneysmith/98a1320e7650)
- CUDA & optimising GPUs
- [Understanding triton](https://isamu-website.medium.com/understanding-the-triton-tutorials-part-1-6191b59ba4c)
- [Triton Documentation](https://triton-lang.org/main/getting-started/tutorials/01-vector-add.html#sphx-glr-getting-started-tutorials-01-vector-add-py)
- [Reddit post on triton](https://www.reddit.com/r/OpenAI/comments/18nf310/openai_triton_coursetutorial_recommendations/)
Expand Down Expand Up @@ -57,8 +58,9 @@ image: [add image]
- [Performance and scalability by hf](https://huggingface.co/docs/transformers/v4.19.4/en/performance)
- [Optimization for llama](https://atscaleconference.com/videos/faster-than-fast-networking-and-communication-optimizations-for-llama-3/)
- [TorchTitan](https://github.com/pytorch/torchtitan)
- [Technical conferences on sharded training ](https://www.youtube.com/@scaleconference/videos)
- [Technical conferences on sharded training ](https://www.youtube.com/@scaleconference/videos)
- Guide for distributed training and training multiple GPUs
- [Distributed inference](https://huggingface.co/docs/diffusers/main/en/training/distributed_inference#model-sharding)
- [Meta blog on sharded training](https://engineering.fb.com/2021/07/15/open-source/fsdp/)
- [Making deep learning go brrrr](https://horace.io/brrr_intro.html)
Expand All @@ -67,26 +69,44 @@ image: [add image]
- A guide to hacking LLMs
- [Pliny the liberator](https://x.com/elder_plinius/highlights) -> I think I can only reverse engineer his tweets, no other option.
- [Blog on hacking LLMS](https://yourgpt.ai/blog/general/how-to-hack-large-language-models-llm)
- [SOme blog](https://www.siam.org/publications/siam-news/articles/how-to-exploit-large-language-models-for-good-or-bad/)
- [Another blog](https://www.comet.com/site/blog/prompt-hacking-of-large-language-models/)
- [A youtube video on the topic](https://www.youtube.com/watch?v=6bYGhY9HB8k)
- Building a 2B model from scratch
- pytorch docs, hf and umar jamil
- Building a vision model
- [HF blog on building vlm from scratch](https://huggingface.co/blog/AviSoori1x/seemore-vision-language-model)
- [Meta's intro to VLM modelling](https://ai.meta.com/research/publications/an-introduction-to-vision-language-modeling/)
- [Medium article on building a VLM](https://medium.com/@uthmant14/building-a-custom-vision-language-model-with-hugging-face-7d51099bfbcb)
- [Another one](https://medium.com/@achrafabbaoui/implementation-of-vision-language-models-vlm-from-scratch-a-comprehensive-technical-deep-dive-d348322f9b3c)
- [Paper on building a VLM](https://arxiv.org/abs/2408.12637)
- [Encode blog on VLMs](https://encord.com/blog/vision-language-models-guide/#:~:text=Building%20VLMs%20involves%20pre%2Dtraining,for%20more%20specific%20downstream%20tasks.)
- How do video gen models work
- [lil log blog on the topic](https://lilianweng.github.io/posts/2024-04-12-diffusion-video/)
-
- unsupervised learning and RL, the dark horse of ML
- [RL training of gaming agents](https://diamond-wm.github.io/)
- An MLE's guide to WEB DEVELOPMENT
IDK read to figure out
- [The paradox of diffusion distillation](https://sander.ai/2024/02/28/paradox.html) -> more blogs on diffusion by this guy
- [Good blogs, some may be useful](https://mccormickml.com/)
- [Timeline for AI](https://nhlocal.github.io/AiTimeline/#2024)
- An MLE's guide to WEB DEVELOPMENT
IDK read to figure out
- [The paradox of diffusion distillation](https://sander.ai/2024/02/28/paradox.html) -> more blogs on diffusion by this guy
- [Good blogs, some may be useful](https://mccormickml.com/)
- [Timeline for AI](https://nhlocal.github.io/AiTimeline/#2024)
- [All of sebastians blogs](https://magazine.sebastianraschka.com/archive?sort=top)
- [idk](https://jakevdp.github.io/blog/2013/04/15/code-golf-in-python-sudoku/)
- [Build your own x](https://github.com/codecrafters-io/build-your-own-x)
- [Neetcode](https://neetcode.io/)
-
- -->
- [Umar's videos on various topics](https://www.youtube.com/@umarjamilai/videos)
- -->

0 comments on commit d2be687

Please sign in to comment.