Added code to allow users to specify inference parameters for Huggingface models using HuggingfaceInferenceAPI #15680

AkashParua · 2024-08-27T22:27:08Z

Description

This PR is related to issue #15659 - I added code to allow users to specify inference parameters for Huggingface models using HuggingfaceInferenceAPI . Refer to link to see how this code integrates with huggingface.

Fixes #15659

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

Yes
No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

Yes
No

Type of Change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

I stared at the code and made sure it makes sense

Suggested Checklist:

I have performed a self-review of my own code

… HuggingfaceInferenceAPI

logan-markewich · 2024-08-28T20:50:52Z

...-integrations/llms/llama-index-llms-huggingface-api/llama_index/llms/huggingface_api/base.py

+            "The value of the stop parameter to use when generating text. This parameter"
+            " is used to stop generating text when one of the specified tokens is generated."
+        ),
+    )


None of these parameters are used though? Seems like there needs to be some function to get model kwargs when using the client?

Yes! Sorry for the oversight. Correct me if I am wrong - link - here we can add these extra arguments. I am new to open-source contributions, I would really appreciate some guidance.

Hmm, not in the metadata, but I think these options need to be passed when using the clients, like here

llama_index/llama-index-integrations/llms/llama-index-llms-huggingface-api/llama_index/llms/huggingface_api/base.py

Line 228 in 1f62f0e

output: "ConversationalOutput" = self._sync_client.conversational(

Probably we want a helper function to gather up the kwargs, and then each time a client is used, that function can help gather all the kwargs and send them along

Working on it! Thanks !

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Aug 27, 2024

AkashParua added 3 commits August 28, 2024 13:05

added code to allow inference parameters for Huggingface models using…

43d7174

… HuggingfaceInferenceAPI

removed repeating code

3e2e811

version bump

b9e32df

AkashParua force-pushed the main branch from 9af5ad5 to b9e32df Compare August 28, 2024 07:35

logan-markewich reviewed Aug 28, 2024

View reviewed changes

logan-markewich self-assigned this Aug 28, 2024

AkashParua and others added 2 commits August 29, 2024 02:39

corrected linting with ruff

2bb9fc5

Merge branch 'main' into main

dc35b1d

AkashParua marked this pull request as draft August 29, 2024 21:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added code to allow users to specify inference parameters for Huggingface models using HuggingfaceInferenceAPI #15680

Added code to allow users to specify inference parameters for Huggingface models using HuggingfaceInferenceAPI #15680

AkashParua commented Aug 27, 2024 •

edited

Loading

logan-markewich Aug 28, 2024

AkashParua Aug 28, 2024

logan-markewich Aug 30, 2024

AkashParua Aug 31, 2024

Added code to allow users to specify inference parameters for Huggingface models using HuggingfaceInferenceAPI #15680

Are you sure you want to change the base?

Added code to allow users to specify inference parameters for Huggingface models using HuggingfaceInferenceAPI #15680

Conversation

AkashParua commented Aug 27, 2024 • edited Loading

Description

New Package?

Version Bump?

Type of Change

How Has This Been Tested?

Suggested Checklist:

logan-markewich Aug 28, 2024

Choose a reason for hiding this comment

AkashParua Aug 28, 2024

Choose a reason for hiding this comment

logan-markewich Aug 30, 2024

Choose a reason for hiding this comment

AkashParua Aug 31, 2024

Choose a reason for hiding this comment

AkashParua commented Aug 27, 2024 •

edited

Loading