Spaces

·

The AI App Directory

New Space Get PRO Learn more

Omnilingual ASR Media Transcription

Transcribe audio or video into text in multiple languages

NeuTTS-Air

Generate speech using reference audio and text

AI Video Enhancer 4K

Enhance and upscale short videos using AI

Wan2.2 14B Fast Preview

generate a video from an image with a text prompt

Echo-TTS Preview

Fast, multi-speaker TTS (44.1kHz) with voice cloning

HighQualityPython

Generate higher quality Python with a clened DeepSeek-Coder.

Z-Image Turbo (ZIT) Controlnet

Edit and guide image generation

Number Recognizer

Door number recognition

Pianos

Piano Sound Quality Classifier

HEp2

HEp2 cell image classifier

PyTorch CV Backbones

Retrieve image model information and generate JSON data

Bel Canto Discriminator

Discriminator of Bel Canto and Chinese Folk Singing

Web Tools

Online programming aids

MassivelyMultilingualTTS

Generate speech from text in multiple languages

Whisper WebGPU

Convert spoken words into text

Erhu Playing Tech

Erhu Performance Technique Recognizer

Joy Caption Alpha Two

Generate captions for images in various styles

Figured Bass Calculator

Convert figured bass to chord

Image Face Upscale Restoration-GFPGAN-RestoreFormerPlusPlus-CodeFormer

Enhance and upscale images with advanced models

Guzheng Tech99

Frame-level guzheng playing technique detector

Pentatonic Mode

Chinese Music Pentatonic Mode Detector

Chinese Instruments

Chinese Traditional Instrument Sound Retriever

Guzheng Playing Tech

Guzheng Performance Technique Recognizer

SMTP Tester

Test Simple Mail Transfer Protocol (SMTP)