Profit from scraper lfu-rlp and integrate further federal institutions of Rheinland-Pfalz (!879) · Merge requests · umwelt-info / metadaten

websites of RLP similar structured as lfu.rlp.de:

The website bildung.rlp.de was not included, as environment and nature conservation was not really represented in the sitemap.

Different minor things have to be solved:

Some title contain to much information ("Alle Badeseen", "Aktuelle Projekte SNU")
Some title on Detailpage are more informative than the title in the sitemap (e.g. Fischotter [SNU])
More pages than available are scraped for category "Aktuelles"
SNU has empty pages in "Links und Downloads"
Track PDF error as unknown log message

Another task, that has to be done separately:

fawf.wald.rlp.de (here merges and aggregations are necessary and should be handled in a respective scraper)

Edited May 28, 2025 by Stefan Krämer

Profit from scraper lfu-rlp and integrate further federal institutions of Rheinland-Pfalz