Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

worksforme model training task is failing (probably OOM) #4095

Closed
marco-c opened this issue Mar 12, 2024 · 3 comments · Fixed by #4096
Closed

worksforme model training task is failing (probably OOM) #4095

marco-c opened this issue Mar 12, 2024 · 3 comments · Fixed by #4096

Comments

@marco-c
Copy link
Collaborator

marco-c commented Mar 12, 2024

https://community-tc.services.mozilla.com/tasks/TOyLl-ubQwehM8liNwtRdA/runs/0

@PromiseFru
Copy link
Collaborator

https://community-tc.services.mozilla.com/tasks/TOyLl-ubQwehM8liNwtRdA/runs/0

Oh, the training was successful on TC. Perhaps I should try the compute-large worker?

@marco-c
Copy link
Collaborator Author

marco-c commented Mar 12, 2024

Yeah, let's try that!

@suhaibmujahid
Copy link
Member

Oh, the training was successful on TC. Perhaps I should try the compute-large worker?

The tanning on TC was using compute-large:

workerType: compute-large

However, the data pipeline currently is using compute-small:

workerType: compute-small

So changing it to compute-large should fix the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants