Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add image support and show sources after upload #147

Merged
merged 18 commits into from
May 19, 2023
Merged

Conversation

pseudotensor
Copy link
Collaborator

@pseudotensor pseudotensor commented May 18, 2023

============================================================================================= short test summary info =============================================================================================
FAILED tests/test_manual_test.py::test_chat_context - NotImplementedError: MANUAL TEST FOR NOW
FAILED tests/test_manual_test.py::test_upload_one_file - NotImplementedError: MANUAL TEST FOR NOW -- do and ask query of file
FAILED tests/test_manual_test.py::test_upload_multiple_file - NotImplementedError: MANUAL TEST FOR NOW -- do and ask query of files
FAILED tests/test_manual_test.py::test_upload_url - NotImplementedError: MANUAL TEST FOR NOW -- do and ask query of content of url
FAILED tests/test_manual_test.py::test_upload_pasted_text - NotImplementedError: MANUAL TEST FOR NOW -- do and see test code for what to try
=================================================================== 5 failed, 29 passed, 8 skipped, 1 xpassed, 10 warnings in 456.55s (0:07:36) ===================================================================
(h2ollm) jon@pseudotensor:~/h2o-llm$ 

(h2ollm) jon@pseudotensor:~/h2o-llm$ 

image

image

image

@pseudotensor pseudotensor changed the title Add image support Add image support and show sources after upload May 18, 2023
Separate out image and non-image to handle BLIP forking.
Related, if fast tokenizer loaded, and then fork, complains and can hang, but not always.
From tests can hang, so avoid fast there, but in gradio no problems uploading large zip with many pdfs and embedding those and doing more files after that in parallel.

Note some easy PDFs can be empty/garbage.

Don't reload chroma db after created.

Don't give no sources if making db, chroma doesn't handle empty case and just fails.

Fix docx/doc in list.

Avoid upload file button, just rely upon box.
…ypes of inputs, including multiple file upload from gradio, not just single file.
@pseudotensor
Copy link
Collaborator Author

@pseudotensor pseudotensor merged commit 5371c17 into main May 19, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant