Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dockerfile for ROCm #311

Draft
wants to merge 4 commits into
base: main
Choose a base branch
from
Draft

Dockerfile for ROCm #311

wants to merge 4 commits into from

Conversation

HPPinata
Copy link

@HPPinata HPPinata commented Oct 6, 2024

This is a first Draft of a usable Dockerfile for AMD ROCm cards.

Based on my testing it currently only works with the raw-png branch, probably due to the newer dependencies (9.0.7 on AMD is also broken with micromamba).

The flash_attn handling is also not that pretty, but I'm not sure how to improve it. Feedback is especially welcome here.

I'll also have to do some performance testing to see whether there's any negative impact, and if newer ROCm versions (6.1.2) or even potentially mismatched versions (6.0 on 6.2 or 6.1 on 6.2) improve things.

There might also STILL be some memory leaks during model load, but using Docker and venv compard to micromamba makes bisecting easier.

apparently the build caching works a lot better with a plain venv, so apart from a few seconds refreshing repos this step only takes time on first install anyways.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant