multimodal meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 138k • 1.6k
meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 138k • 1.6k
audio-collection Build error Agents 33 Parakeet-tdt_ctc-1.1b 🦜 33 Transcribe audio with timestamps
multimodal meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 138k • 1.6k
meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 138k • 1.6k
audio-collection Build error Agents 33 Parakeet-tdt_ctc-1.1b 🦜 33 Transcribe audio with timestamps