-
Notifications
You must be signed in to change notification settings - Fork 127
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Newer versions of the library on maven central #96
Comments
Unfortunately, we don't have any plans to upgrade the current Spark version, but we are always re-evaluating our roadmap based on customer feedback! |
Thanks for that Lauren! Trying to understand it. Is Spark-SageMaker on the roadmap and would you recommend users to continue building solutions on it? Is there something else you would like us to go with when integrating with SageMaker especially when running jobs on EMR? |
Please refer to our documentation for Spark support. https://docs.aws.amazon.com/sagemaker/latest/dg/apache-spark.html We are evaluating our roadmap and will add support for the latest version in the future. |
Thanks a lot Nadia! |
I had issues running sagemaker_pyspark on EMR 5.22 per this closed issue. I was able to have it work with no issue and confirm this with an AWS tech support. The changes I had to apply are listed in my comments in the closed issue linked above. Figured I'd also post here in case it can benefit anyone else. One question though. It appears that sagemaker_pyspark SDK is not updated as often as sagemaker python SDK. Should we not be concerned because sagemaker_pyspark is a wrapper for sagemaker python SDK; or is it indeed lower priority in your roadmap and therefore receives less support? |
System Information
Describe the problem
EMR clusters which use spark 2.3 and later have newer versions of sagemaker spark jars.
However they are not available on maven central : https://mvnrepository.com/artifact/com.amazonaws/sagemaker-spark
When is the plan to release to maven central for spark 2.3 and later? Or any recommendations for running on later EMR versions of the cluster.
Minimal repo / logs
The text was updated successfully, but these errors were encountered: