Add method to automatically determine content type based on HTTP headers (!504) · Merge requests · umwelt-info / metadaten

Adam Reichold requested to merge suche-st-fetch-doc into main Feb 12, 2024

This avoids parsing PDF as HTML in suche_st harvester. (At least after we deleted the three offending documents from the cache once.)

Add method to automatically determine content type based on HTTP headers