Public Datasets

Explore VNOVA AI datasets hosted on Hugging Face. Use them for research, RAG, fine-tuning, or safety experiments.

INDIA_FRAUD_DETECTION_JSONL_V1
100 synthetic fraud & scam scenarios — JSONL — CC-BY-4.0
View on Hugging Face
LEGAL_ASSISTANT_DATASET_JSONL_INDIA
60 synthetic legal & crime-safety scenarios — JSONL — CC-BY-4.0
View on Hugging Face
EMOTIONAL_SUPPORT_DATASET_V1
100 synthetic emotional support — JSONL — CC-BY-4.0
View on Hugging Face
CYBERSECURITY_DATASET_V1
100 synthetic cybersecurity scenarios — JSONL — CC-BY-4.0
View on Hugging Face
DECISION_MAKING_ASSISTANT_DATASET_V1_JSONL
100 synthetic decision making scenarios — JSONL — CC-BY-4.0
View on Hugging Face
CUSTOMER_SUPPORT_DATASET_JSONL_V1
100 synthetic customer support scenarios — JSONL — CC-BY-4.0
View on Hugging Face

How to use

# Python example (datasets library)
from datasets import load_dataset
ds = load_dataset("vnovaai/INDIA_FRAUD_DETECTION_JSONL_V1")
print(ds["train"][0])