Skip to content

Possible to fine tune on 2 different dataset classes? #1509

Answered by RdoubleA
troy256 asked this question in Q&A
Discussion options

You must be logged in to vote

Hey @troy256, take a look at torchtune.datasets._concat.ConcatDataset, it does just what you described.

You can use it directly from the config for any recipe by specifying a list for your dataset:

dataset:
  - _component_: torchtune.datasets.instruct_dataset
    ...
  - _component_: torchtune.datasets.text_completion_dataset

Let us know if you run into any issues.

Replies: 4 comments 4 replies

Comment options

You must be logged in to vote
0 replies
Answer selected by joecummings
Comment options

You must be logged in to vote
1 reply
@RdoubleA
Comment options

RdoubleA Sep 9, 2024
Collaborator

Comment options

You must be logged in to vote
1 reply
@RdoubleA
Comment options

RdoubleA Sep 9, 2024
Collaborator

Comment options

You must be logged in to vote
2 replies
@RdoubleA
Comment options

@troy256
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants