Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/lucene search engine #2892

Open
wants to merge 95 commits into
base: main
Choose a base branch
from

Conversation

yanlibert
Copy link
Contributor

Problem

👋 Thanks for opening a [pull request]
Currently, the new version of Marquez uses OpenSearch as a backend for the new search feature.
This might be overkill because not only it introduces an external dependency but also only the search and indexing features of OpenSearch are used.

Solution !! Warning: Currently a WiP !!

This is a small implementation of Lucene to perform only indexing and search of a dataset and a job index. This is done in a form of a subproject that can be run alongside marquez api and marquez-web.
It 's designed as a drop-in replacement of OpenSearch, so it's easy to switch between this implementation or a full-fledge OpenSearch.
It uses a ByteBuffersDirectory so all documents are stored in memory. The datasets and jobs are reloaded in the background at startup from the lineage_events table using the Marquez DAO.

Note: Please note that at the time of opening this PR, this is a PoC only here to open the discussion about the possibility of creating this new Marquez component, and as such it is still lacking some key elements (unit tests, integration tests, memory management feature, proper DB management, proper config ...)

phixMe and others added 29 commits August 9, 2024 16:10
# Conflicts:
#	.env.example
#	docker-compose.web.yml
Signed-off-by: wslulciuc <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Signed-off-by: Yannick Libert <[email protected]>
Copy link

netlify bot commented Sep 9, 2024

Deploy Preview for peppy-sprite-186812 canceled.

Name Link
🔨 Latest commit 5689a67
🔍 Latest deploy log https://app.netlify.com/sites/peppy-sprite-186812/deploys/66df15353dd9ab0008478562

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants