-
The Met
- Brooklyn, NY
- http://www.robotconscience.com
Lists (8)
Sort Name ascending (A-Z)
Stars
Production-grade 3D gaussian splatting with CPU/GPU support for Windows, Mac and Linux 🚀
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
State-of-the-art CLIP/SigLIP embedding models finetuned for the fashion domain. +57% increase in evaluation metrics vs FashionCLIP 2.0.
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
A camera control for three.js, similar to THREE.OrbitControls yet supports smooth transitions and more features.
A comparison of different advanced prompt engineering techniques as applied to image descriptions of artworks for the museum field
🧩 Extend Three.js standard materials with your own shaders!
A library + API spec for easily streaming generative AI output to your chat applications.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Super simple MLX (apple silicon) CLIP based photo similarity web app
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Computational photography pipeline that performs multiple inferences from any image or video.
CVPR 2024: The official implementation of HumanNorm
Node.js samples for Google Cloud Platform products.
Easily display interactive 3D models on the web and in AR!
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
A collection of USD fileformat plugins
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
⏹ Make VR user interfaces for Three.js
OCR, layout analysis, reading order, table recognition in 90+ languages
Blender glTF 2.0 importer and exporter
World tracking for WebAR. A Javascript library for Augmented Reality to run SLAM in the browser.