- bengaluru
-
04:04
- 12h behind - https://www.youtube.com/channel/UC3aFT0jWh0sCtzlNE_1gqRg
Stars
Run Ollama LLM models in Google Colab for free
Easy and fast 2d human and animal multi pose estimation using SOTA ViTPose [Y. Xu et al., 2022] Real-time performances and multiple skeletons supported.
This repo implements and trains Vision Transformer (VIT) on a synthetically generated dataset which has colored mnist images on texture backgrounds
A nice 3D avatar that can speak input text with facial expressions
Python sample codes and documents about Autonomous vehicle control algorithm. This project can be used as a technical guide book to study the algorithms and the software architectures for beginners.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
The Tensorflow, Keras implementation of Swin-Transformer and Swin-UNET
This is the code for "How to Make a Prediction - Intro to Deep Learning #1' by Siraj Raval on YouTube
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Check your ranking in GitHub! Don't forget to star ⭐ this repository.
Using AI to Discover New Cancer Treatments (in silico)
Depth Any Video with Scalable Synthetic Data (ICLR 2025)
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment
Solving the Traveling Salesman Problem using Self-Organizing Maps
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
[ICLR2024] the official pytorch implementation of UC-NeRF
A driving dataset for the development and validation of fused pose estimators and mapping algorithms
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Open-source and strong foundation image recognition models.
[ICCV21 & WACV23] Monocular 3D Object Detection for Automonous Driving
Code for "Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks", Gupta et al, CVPR 2018
Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.