add references

goyalpramod · Feb 17, 2025 · d2be687 · d2be687
1 parent b51f9e5
commit d2be687
Showing 1 changed file with 32 additions and 12 deletions.
diff --git a/_blogs/future_blogs_to_write.markdown b/_blogs/future_blogs_to_write.markdown
@@ -7,6 +7,7 @@ image: [add image]
 ---
 
 - Evolution of LLMs
+
   - [Flash Attention Blog](https://gordicaleksa.medium.com/eli5-flash-attention-5c44017022ad)
   - [Flash Attention 3](https://tridao.me/blog/2024/flash3/)
   - [Mamba](https://tridao.me/blog/)
@@ -23,8 +24,8 @@ image: [add image]
   - [GPT in 60 lines](https://jaykmody.com/blog/gpt-from-scratch)
   - [A Brief History of Large Language Models](https://medium.com/@bradneysmith/98a1320e7650)
 
-
 - CUDA & optimising GPUs
+
   - [Understanding triton](https://isamu-website.medium.com/understanding-the-triton-tutorials-part-1-6191b59ba4c)
   - [Triton Documentation](https://triton-lang.org/main/getting-started/tutorials/01-vector-add.html#sphx-glr-getting-started-tutorials-01-vector-add-py)
   - [Reddit post on triton](https://www.reddit.com/r/OpenAI/comments/18nf310/openai_triton_coursetutorial_recommendations/)
@@ -57,8 +58,9 @@ image: [add image]
   - [Performance and scalability by hf](https://huggingface.co/docs/transformers/v4.19.4/en/performance)
   - [Optimization for llama](https://atscaleconference.com/videos/faster-than-fast-networking-and-communication-optimizations-for-llama-3/)
   - [TorchTitan](https://github.com/pytorch/torchtitan)
-  - [Technical conferences on sharded training ](https://www.youtube.com/@scaleconference/videos) 
+  - [Technical conferences on sharded training ](https://www.youtube.com/@scaleconference/videos)
 - Guide for distributed training and training multiple GPUs
+
   - [Distributed inference](https://huggingface.co/docs/diffusers/main/en/training/distributed_inference#model-sharding)
   - [Meta blog on sharded training](https://engineering.fb.com/2021/07/15/open-source/fsdp/)
   - [Making deep learning go brrrr](https://horace.io/brrr_intro.html)
@@ -67,26 +69,44 @@ image: [add image]
 
 - A guide to hacking LLMs
 
+  - [Pliny the liberator](https://x.com/elder_plinius/highlights) -> I think I can only reverse engineer his tweets, no other option.
+  - [Blog on hacking LLMS](https://yourgpt.ai/blog/general/how-to-hack-large-language-models-llm)
+  - [SOme blog](https://www.siam.org/publications/siam-news/articles/how-to-exploit-large-language-models-for-good-or-bad/)
+  - [Another blog](https://www.comet.com/site/blog/prompt-hacking-of-large-language-models/)
+  - [A youtube video on the topic](https://www.youtube.com/watch?v=6bYGhY9HB8k)
+
 - Building a 2B model from scratch
 
+  - pytorch docs, hf and umar jamil
+
 - Building a vision model
 
+  - [HF blog on building vlm from scratch](https://huggingface.co/blog/AviSoori1x/seemore-vision-language-model)
+  - [Meta's intro to VLM modelling](https://ai.meta.com/research/publications/an-introduction-to-vision-language-modeling/)
+  - [Medium article on building a VLM](https://medium.com/@uthmant14/building-a-custom-vision-language-model-with-hugging-face-7d51099bfbcb)
+  - [Another one](https://medium.com/@achrafabbaoui/implementation-of-vision-language-models-vlm-from-scratch-a-comprehensive-technical-deep-dive-d348322f9b3c)
+  - [Paper on building a VLM](https://arxiv.org/abs/2408.12637)
+  - [Encode blog on VLMs](https://encord.com/blog/vision-language-models-guide/#:~:text=Building%20VLMs%20involves%20pre%2Dtraining,for%20more%20specific%20downstream%20tasks.)
+
 - How do video gen models work
 
+  - [lil log blog on the topic](https://lilianweng.github.io/posts/2024-04-12-diffusion-video/)
+  -
+
 - unsupervised learning and RL, the dark horse of ML
+
   - [RL training of gaming agents](https://diamond-wm.github.io/)
 
-- An MLE's guide to WEB DEVELOPMENT 
-  
-  
-  
-IDK read to figure out 
- - [The paradox of diffusion distillation](https://sander.ai/2024/02/28/paradox.html) -> more blogs on diffusion by this guy
- - [Good blogs, some may be useful](https://mccormickml.com/)
-  - [Timeline for AI](https://nhlocal.github.io/AiTimeline/#2024) 
+- An MLE's guide to WEB DEVELOPMENT
+
+IDK read to figure out
+
+- [The paradox of diffusion distillation](https://sander.ai/2024/02/28/paradox.html) -> more blogs on diffusion by this guy
+- [Good blogs, some may be useful](https://mccormickml.com/)
+- [Timeline for AI](https://nhlocal.github.io/AiTimeline/#2024)
 - [All of sebastians blogs](https://magazine.sebastianraschka.com/archive?sort=top)
 - [idk](https://jakevdp.github.io/blog/2013/04/15/code-golf-in-python-sudoku/)
 - [Build your own x](https://github.com/codecrafters-io/build-your-own-x)
 - [Neetcode](https://neetcode.io/)
-- 
-- -->
+- [Umar's videos on various topics](https://www.youtube.com/@umarjamilai/videos)
+- -->