Scraper for Environmental Innovation Programme
The Environmental Innovation Programme (website only available in German) funds industrial-scale pilot projects in key environmental sectors such as climate protection and resource efficiency. Its website contains information for potential applicants as well as information about ongoing projects that are receiving funding - both of which are to be indexed.
Contents that require scraping:
- All information provided in the section "Über uns" (About Us). This includes the subcategories funding priorities ("Förderschwerpunkte"), news articles ("Neuigkeiten"), and events ("Veranstaltungen"). Milestones ("Meilensteine"), subcategory "Chronik", can be ignored, as this information is also covered by the search index on funded projects.
- All information provided in the section "Förderinformationen" (Funding Information). This includes the quiz-like "Quickcheck" and all downloadable documents.
- All 288 entries in the search index provided in the section "Geförderte Projekte" (Funded Projects).
Acceptance Criteria:
- Scraper for Environmental Innovation Programme merged into main branch and deployed to
md.umwelt.info
. - One dataset for the main content in the section "Über uns" as well as one dataset per sub-entry (individual funding priorities, news articles, and events).
- One dataset for the main content in the section "Förderinformation", one dataset for the quickcheck, and one dataset per downloadable document.
- One dataset for each entry of the search index under "Geförderte Projekte".