Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add scraping of deadlines and conference rankings #473

Open
wants to merge 5 commits into
base: gh-pages
Choose a base branch
from

Conversation

a-nau
Copy link
Contributor

@a-nau a-nau commented Oct 9, 2022

I added automated scraping of

Currently the output are two files (actually yaml, but Github wants txt for upload):

Some open points:

  • WikiCFP has a CC BY-SA 3.0 license, so all the deadline information would need to follow this license if we want to use it
  • I did not find the license for the data scraped from Core
  • The scraping can be done within the Docker container, which means that we could use Github Actions to automatically commit suggestions, i.e. to automatically commit the above mentioned files. This way, people from the community could easily check this data as well? What do you think?
  • What is the "best" output format? I thought having extra yaml files might be a good start, but maybe there are other ideas?
  • Any ideas on how to best match the conference master data information with the results from scraping?
  • Recent changes have been overwritten, need to check conferences.yml before merge

Cheers Alex

@omarsar
Copy link
Contributor

omarsar commented Nov 2, 2022

@a-nau thanks for this PR. I will be reviewing this by the end of week.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants