Spaces

·

The AI App Directory

New Space Get PRO Learn more

Spaces of the week

20 Jul 2026

Bonsai 27B WebGPU Kernels

Run a 1-bit 27B LLM locally in your browser on WebGPU

Gemma 4 - Vision Token Budget

Resize images for visual token budgets while keeping aspect ratio

Nemotron-Labs-Audex

Unified audio-text intelligence

Krea 2 Identity Edit

Identity-preserving instruction image editing on Krea 2

Krea 2 Outpaint

Extend images into larger canvases with Krea 2 outpaint

OvisOCR2

Stream structured Markdown from document images and PDFs.

MOSS-VL-Realtime

Realtime VLM for image and video understanding

LTX-Best-Face-ID

Distilled LTX-2.3 identity video from a reference photo

All running apps, trending first

Bonsai 27B WebGPU Kernels

Run a 1-bit 27B LLM locally in your browser on WebGPU

Wan2.2 14B Fast Preview

generate a video from an image with a text prompt

Reproducing ICML 2026

Reproduce every ICML 2026 paper with your agent

HF Realtime Voice

Voice chat over WebSocket against a HF speech-to-speech

WANMAN

WAN2.2 based I2V

Krea 2 Identity Edit

Identity-preserving instruction image editing on Krea 2

Omni Image Editor

Image edit, text to image, image upscale, remove watermark

Qwen-Image-Edit-2511-LoRAs-Fast

Demo of the Collection of Qwen Image Edit LoRAs

Unlimited OCR

Extract text from images and PDFs instantly

Gemma Avatar

Talk to Gemma 4 face to face, with a 3D lip-synced avatar

Dream Motion Pro - Wan 2.2

Fast Wan 2.2 image-to-video with first/last frames

FLUX.2 Klein multi-LoRA

Use multiple FLUX.2-Klein LoRAs

Wan2.2 14B Fast Preview

generate a video from an image with a text prompt

OvisOCR2

Stream structured Markdown from document images and PDFs.

Krea 2 Outpaint

Extend images into larger canvases with Krea 2 outpaint

UniSE Speech Enhancement

Unified AR-LM-based speech enhancement & separation

LingBot-Video MoE 30B-A3B

Embodied MoE video generation — T2V, I2V, T2I

Wan2.2 14B Fast Preview

generate a video from an image with a text prompt

Pulpie

Clean HTML 20x faster — encoder vs decoder, live

Wan2.2 14B Preview

generate a video from an image with a text prompt

Z Image Turbo

Generate vivid images from text prompts in seconds

Hy3

Hy3 multi-turn streaming chat with function calling

TRELLIS.2

High-fidelity 3D Generation from images

Nemotron-Labs-Audex

Unified audio-text intelligence