feat(kernel-memory): avoid loading model twice. #248

AsakusaRinne · 2023-11-05T09:19:09Z

@xbotter Please help to review it. Thank you!

xbotter

LGTM.
I noticed that there is an EmbeddingMode in ModelParams. Will it have any impact on the embedding?

martindevans · 2023-11-05T12:46:02Z

I'm not certain but I think embedding mode loads the model in a way that can only do embedding.

AsakusaRinne · 2023-11-05T13:35:28Z

Martin's right. Maybe one of the reasons behind it is the kv cache. If only using embedding mode, it seems no need for kv-cache. However what confused me is that why a model for inference cannot be used to generate embeddings.

feat(kernel-memory): avoid loading model twice.

46f01bb

xbotter approved these changes Nov 5, 2023

View reviewed changes

AsakusaRinne merged commit a9434c2 into SciSharp:master Nov 5, 2023
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(kernel-memory): avoid loading model twice. #248

feat(kernel-memory): avoid loading model twice. #248

AsakusaRinne commented Nov 5, 2023 •

edited

Loading

xbotter left a comment

martindevans commented Nov 5, 2023

AsakusaRinne commented Nov 5, 2023

feat(kernel-memory): avoid loading model twice. #248

feat(kernel-memory): avoid loading model twice. #248

Conversation

AsakusaRinne commented Nov 5, 2023 • edited Loading

xbotter left a comment

Choose a reason for hiding this comment

martindevans commented Nov 5, 2023

AsakusaRinne commented Nov 5, 2023

AsakusaRinne commented Nov 5, 2023 •

edited

Loading