scrapers-for-journalists

Scraper(s) to help the journalists retrieve data or monitor sites for potential leads for stories.

Using the scrapers

pip install scrapers_for_journalists==0.1.0

And then import a scraper, e.g. from domstoldk.retrive import DomStolScrape

Every file in utils/can be imported in your scrapers, as it is added as a package in pyproject.toml. For example, you can import the BaseScraper with generic utilities like: from base import BaseScraper.

Description of current scrapers

domstol.dk

This scrapers retrieves information about current court cases ("retslister") in Danish "byretter" (Currently, Højesteret etc. are not included). Civil cases and tvangsauktioner are filtered away. Relevance of the cases are estimated based on keywords and "gerningskoder" (types of crimes) from the Danish Police.

To run it manually, use:

poetry run python domstol-dk/retrieve.py --outfile test.xlsx

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
domstoldk		domstoldk
utils		utils
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

scrapers-for-journalists

Using the scrapers

Description of current scrapers

domstol.dk

About

Releases

Packages

Languages

kristeligt-dagblad/scrapers_for_journalists

Folders and files

Latest commit

History

Repository files navigation

scrapers-for-journalists

Using the scrapers

Description of current scrapers

domstol.dk

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages