Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New feature required- Support different data sources #1044

Open
moheikal79 opened this issue Sep 22, 2022 · 1 comment
Open

New feature required- Support different data sources #1044

moheikal79 opened this issue Sep 22, 2022 · 1 comment
Labels
enhancement New feature or request

Comments

@moheikal79
Copy link

Hi,
Currently daal4py accepts only CSV, np.array and pandas dataframes. If data more than to fit into memory and located in databases different solutions are come up such as polars df which uses connectorx to access data stored in databases and ibis that uses databases as backend such as ClickHouse. So, it would be great if daal4py can accept different data sources, to handle bigger data sizes.

here is the error I got when try to read a clickhouse table using ibis into daal4py
Got type 'ClickhouseTable' when expecting string, array, or list of 1d-arrays.

here is the error I got when try to read a polars df into daal4py
Got type 'DataFrame' when expecting string, array, or list of 1d-arrays.

Regards
Mohamed

@moheikal79 moheikal79 added the bug Something isn't working label Sep 22, 2022
@Alexsandruss Alexsandruss added enhancement New feature or request and removed bug Something isn't working labels Sep 23, 2022
@napetrov
Copy link
Contributor

We are considering adding support of different dataframes - https://data-apis.org/dataframe-protocol/latest/purpose_and_scope.html

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants