Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
hallucinations-leaderboard
community
https://www.neuralnoise.com
pminervini
pminervini
Activity Feed
Request to join this org
Follow
17
AI & ML interests
None defined yet.
Recent Activity
pminervini
authored
a paper
about 20 hours ago
VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models
pminervini
authored
a paper
about 23 hours ago
SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks
aryopg
authored
a paper
about 23 hours ago
SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks
View all activity
Team members
10
spaces
1
pinned
Runtime error
Agents
145
Hallucinations Leaderboard
🔥
View and submit LLM evaluations
models
0
None public yet
datasets
2
Sort: Recently updated
hallucinations-leaderboard/requests
Preview
•
Updated
Oct 31, 2024
•
323
hallucinations-leaderboard/results
Updated
Oct 31, 2024
•
28.9k
•
2