Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

weather-mv: Support streaming conversions to BigQuery. #51

Closed
alxmrs opened this issue Dec 29, 2021 · 0 comments · Fixed by #58
Closed

weather-mv: Support streaming conversions to BigQuery. #51

alxmrs opened this issue Dec 29, 2021 · 0 comments · Fixed by #58
Assignees

Comments

@alxmrs
Copy link
Collaborator

alxmrs commented Dec 29, 2021

Use Case

As a user, I want to be able to load real-time forecasts from cloud storage to BigQuery. I want to do this in the simplest, most efficient computational means. For example, ECMWF disseminates data 4x per day into cloud storage buckets. Once an upload event occurs, I want data to automatically start ingesting into BigQuery.

Proposal

Support a new argument --topic that lets users specify a GCP PubSub topic of CREATE or FINALIZE events to a GCS bucket. This argument or --uris will be required flags. Usage of --topic will deploy a streaming dataflow job that subscribes to PubSub events, windows them into 1 minute buckets, and performs the extract_rows operation on incoming data.

Note: for other clouds, users can bridge other notification systems to GCP's PubSub in the short term. Apache Kafka support can come later.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant