JSON mode #1527

pseudotensor · 2024-04-02T21:32:34Z

example guided_json, guided_regex, guided_choice schemas to be passed in as string to h2oGPT.

TEST_SCHEMA = {
    "type": "object",
    "properties": {
        "name": {
            "type": "string"
        },
        "age": {
            "type": "integer"
        },
        "skills": {
            "type": "array",
            "items": {
                "type": "string",
                "maxLength": 10
            },
            "minItems": 3
        },
        "work history": {
            "type": "array",
            "items": {
                "type": "object",
                "properties": {
                    "company": {
                        "type": "string"
                    },
                    "duration": {
                        "type": "string"
                    },
                    "position": {
                        "type": "string"
                    }
                },
                "required": ["company", "position"]
            }
        }
    },
    "required": ["name", "age", "skills", "work history"]
}

TEST_REGEX = (r"((25[0-5]|(2[0-4]|1\d|[1-9]|)\d)\.){3}"
              r"(25[0-5]|(2[0-4]|1\d|[1-9]|)\d)")

TEST_CHOICE = [
    "Python", "Java", "JavaScript", "C++", "C#", "PHP", "TypeScript", "Ruby",
    "Swift", "Kotlin"
]

pseudotensor · 2024-04-03T01:11:31Z

pseudotensor · 2024-04-03T01:16:46Z

pseudotensor · 2024-04-03T01:16:55Z

pseudotensor · 2024-04-03T07:10:15Z

Working for all except Cohere, which should be native vLLM json

…ust contain the word 'json' in some form, to use 'response_format' of type 'json_object'.", 'type': 'invalid_request_error', 'param': 'messages', 'code': None}} and update test

pseudotensor added 12 commits April 2, 2024 14:31

guided_json for vllm and openai use

62b3d8c

Update packages

3fad044

Add API parameters for json mode

a043e60

langchain migration to langchain_community

cc265fd

Fix mistake in migration

811702b

Handle vllm non-chat API and add test

3ad2457

Fix response_format for extra_body

e5791ca

Fix string

5530a84

Transcribe

a6cd99a

Handle langchain path for response_format and guided_json et al.

1911843

Deal with json responses for non ops

a8667f2

Adjust test for no docs to summarize

5d4336f

pseudotensor added 3 commits April 2, 2024 23:21

Handle streaming JSON or from code block. Add tests

706dc94

Always try to get json, e.g. in case vllm is old

b40d46d

Handle old vLLMs that don't have version or json mode

16927de

pseudotensor added 11 commits April 3, 2024 00:42

Fix real vllm json but when no guided_json

3a26717

Update test

7d42695

Fix is_json_model use

1ebdbab

Properly deal with vllm with json but no guided_json

322f5e6

schema vs. properties of it for prompt

0174634

Remove debug

178a046

For ValueError: Error code: 400 - {'error': {'message': "'messages' m…

6c9005d

…ust contain the word 'json' in some form, to use 'response_format' of type 'json_object'.", 'type': 'invalid_request_error', 'param': 'messages', 'code': None}} and update test

Update test

54614fc

Test fixes and dict vs. json fix

f5289ce

Test fixes and dict vs. json fix

f7d4863

Deal with corners like Capy made

886e0ee

pseudotensor added 6 commits April 3, 2024 04:15

Update test

59f2b49

Update test

8465623

Gemma messes up json with missing }

36e28d8

Deal with mistral-large adding json tag inside json

439c6f1

Update test

e44211a

Protection

79e0235

pseudotensor marked this pull request as ready for review April 3, 2024 16:30

pseudotensor added 2 commits April 3, 2024 09:32

Add get_json for other non-streaming cases

6d06fd2

Deal with vllm_extra_dict for vllm_chat

4832389

pseudotensor changed the title ~~guided_json for vllm and openai use~~ JSON mode Apr 3, 2024

pseudotensor merged commit 84fcce1 into main Apr 3, 2024
2 checks passed

pseudotensor deleted the guided_json branch April 3, 2024 16:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JSON mode #1527

JSON mode #1527

pseudotensor commented Apr 2, 2024 •

edited

Loading

pseudotensor commented Apr 3, 2024

pseudotensor commented Apr 3, 2024

pseudotensor commented Apr 3, 2024

pseudotensor commented Apr 3, 2024

JSON mode #1527

JSON mode #1527

Conversation

pseudotensor commented Apr 2, 2024 • edited Loading

pseudotensor commented Apr 3, 2024

pseudotensor commented Apr 3, 2024

pseudotensor commented Apr 3, 2024

pseudotensor commented Apr 3, 2024

pseudotensor commented Apr 2, 2024 •

edited

Loading