Enable loading data sets from files for custom tasks #1083

davebiagioni · 2025-11-24T19:31:33Z

Purpose: Allow custom tasks to load datasets from local files, not only from the Hugging Face Hub. Useful for offline / air‑gapped / otherwise restricted environments

Changes

Config: Add optional hf_data_files to LightevalTaskConfig.
Loader: Forward hf_data_files as data_files to datasets.load_dataset in LightevalTask.download_dataset_worker.
Examples: Add examples/custom_tasks_templates/custom_yourbench_task_from_files.py.
Docs: Update docs/source/adding-a-custom-task.mdx with file-based usage.

Checklist

Tests pass locally
Pre-commit hooks pass locally
Added/updated documentation

HuggingFaceDocBuilderDev · 2025-12-04T14:54:26Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

NathanHB

Looking great ! few nits and good to merge if tests pass :)

docs/source/adding-a-custom-task.mdx

src/lighteval/tasks/lighteval_task.py

davebiagioni · 2025-12-04T19:31:47Z

@NathanHB review comments addressed. thanks!

enable use of data files for custom tasks

de830f6

davebiagioni mentioned this pull request Nov 24, 2025

Loading local data for custom tasks #681

Open

Merge branch 'main' into enable-data-files

d8cd81e

NathanHB reviewed Dec 4, 2025

View reviewed changes

docs/source/adding-a-custom-task.mdx Outdated Show resolved Hide resolved

docs/source/adding-a-custom-task.mdx Show resolved Hide resolved

src/lighteval/tasks/lighteval_task.py Outdated Show resolved Hide resolved

NathanHB mentioned this pull request Dec 4, 2025

[FT] Support local datasets #604

Closed

dbiagioni-plutoflume and others added 2 commits December 4, 2025 10:40

addressing PR comments, create new doc file, update docstring with types

cdebb52

Merge branch 'main' into enable-data-files

5396645

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable loading data sets from files for custom tasks #1083

Enable loading data sets from files for custom tasks #1083

davebiagioni commented Nov 24, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 4, 2025

Uh oh!

NathanHB left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

davebiagioni commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Enable loading data sets from files for custom tasks #1083

Are you sure you want to change the base?

Enable loading data sets from files for custom tasks #1083

Conversation

davebiagioni commented Nov 24, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 4, 2025

Uh oh!

NathanHB left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

davebiagioni commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants