Open Source AI Finder
Discover the latest open-source models for your projects.
Signup For The AI Newsletter
Get the latest open-source AI models delivered regularly.
MVP4D
Computer VisionA high-resolution dataset and a method for estimating 4D scene flow (3D motion of points over time) from multi-view camera inputs.
D2E (Dialog-to-Event)
Natural Language ProcessingA dataset and baseline model for open-domain event extraction from conversations, aiming to identify and structure event information discussed in dialogues.
RTFM (Worldlabs AI)
Natural Language ProcessingA technique that allows a language model to answer questions about a large document by ingesting its entire content into the context, using a leave-one-out attention mechanism, without needing to fine-tune the model.
TAG
text-to-3dA model that generates realistic and controllable human actions within 3D scenes based on natural language descriptions.
PhysHSI
Image GenerationA physically-based rendering framework for synthesizing realistic hyperspectral images (HSI), which can be used as training data for other deep learning models.
Ring-1T
text-generationA 1-trillion parameter Chinese-English bilingual large language model, demonstrating strong capabilities in both languages.
UP2You
text-to-imageA training-free method to personalize text-to-image models, enabling the generation of images featuring specific subjects (like a person or pet) from just a few personal photos.
DeepSomatic
Scientific UnderstandingAn AI model based on DeepVariant technology that accurately identifies genetic variants in tumors (somatic variants) from DNA sequencing data.
DreamOmni2
image-to-3dA unified diffusion model that generates high-fidelity 3D objects and consistent multi-view images from either a single image or a text prompt.
StreamingVLM
multimodalAn efficient framework for Large Multimodal Models (LMMs) to process and understand long videos in a streaming fashion, maintaining high accuracy without needing to access the entire video at once.
