Public Datasets

Explore VNOVA AI datasets hosted on Hugging Face. Use them for research, RAG, fine-tuning, or safety experiments.

INDIA_FRAUD_DETECTION_JSONL_V1
100 synthetic fraud & scam scenarios — JSONL — CC-BY-4.0
View on Hugging Face
LEGAL_ASSISTANT_DATASET_JSONL_INDIA
60 synthetic legal & crime-safety scenarios — JSONL — CC-BY-4.0
View on Hugging Face
EMOTIONAL_SUPPORT_DATASET_V1
100 synthetic emotional support — JSONL — CC-BY-4.0
View on Hugging Face
CYBERSECURITY_DATASET_V1
100 synthetic cybersecurity scenarios — JSONL — CC-BY-4.0
View on Hugging Face
DECISION_MAKING_ASSISTANT_DATASET_V1_JSONL
100 synthetic decision making scenarios — JSONL — CC-BY-4.0
View on Hugging Face
CUSTOMER_SUPPORT_DATASET_JSONL_V1
100 synthetic customer support scenarios — JSONL — CC-BY-4.0
View on Hugging Face
EMERGENCY_DISASTER_RESPONSE_V1_JSONL
100 synthetic customer support scenarios — JSONL — CC-BY-4.0
View on Hugging Face
STARTUP_STRATEGY_DATASET_JSONL_V1
100 synthetic Startup Strategy scenarios — JSONL — CC-BY-4.0
View on Hugging Face
VNOVA_AI_CODING_LOGIC_TUTOR_DATASET_V1_JSONL
100 synthetic Coding Logic scenarios — JSONL — CC-BY-4.0
View on Hugging Face
AI_Creativity_Booster_Dataset_V1_JSONL
100 synthetic AI Creativity Booster scenarios — JSONL — CC-BY-4.0
View on Hugging Face
Emerging_AI-First_Industries_V1_JSONL
100 synthetic Emerging AI-First Industries scenarios — JSONL — CC-BY-4.0
View on Hugging Face
ai_agent_and_automation_dataset_v1_jsonl
100 synthetic AI Agent & Automation scenarios — JSONL — CC-BY-4.0
View on Hugging Face

How to use

# Python example (datasets library)
from datasets import load_dataset
ds = load_dataset("vnovaai/INDIA_FRAUD_DETECTION_JSONL_V1")
print(ds["train"][0])