Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to use H2O on databricks without pysparkling? #16401

Open
matt7salomon opened this issue Sep 25, 2024 · 2 comments
Open

Is it possible to use H2O on databricks without pysparkling? #16401

matt7salomon opened this issue Sep 25, 2024 · 2 comments
Assignees
Labels

Comments

@matt7salomon
Copy link

I am trying to use H2O on my databricks cluster mainly to use the 64GB Cuda GPU that I have. I dont want to convert my datasets to spark datasets though as everything else is in pandas. Is it possible to run h2o on databricks and use GPU without pysparkling? If so, what ip address do I use in my h2o.init(ip = ) . I did try this and the h2o datasets appear to be filled with all nulls.

@krasinski
Copy link
Member

hello @matt7salomon, we didn't really try this
would there be any benefit of using h2o on databricks without spark?
are you able to use the compute without databricks? that's usually possible in cloud environments, probably also more cost effective

@matt7salomon
Copy link
Author

Yes. I found out how to install it on the cluster and it works. I need to just open a terminal and download the h2o.jar and move it to the directory itself looking in

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants