Endlosschleife bei Zuordnung von Entities zu Tokens: Nerwhal

PDF Upload (An5_Jahresbericht_CMB.pdf) scheitert und landet in einer Endlosladeschleife.

[An5_Jahresbericht_CMB.pdf] (/uploads/ad18824fbaf4965569d9023e4e50e1b5/An5_Jahresbericht_CMB.pdf)

Fehlermeldung:

/..... / /During handling of the above exception, another exception occurred: / /Traceback (most recent call last): / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/nerwhal/entity_aligner.py", line 34, in _set_start_tok / ent.start_tok = self._char_to_token_idx[ent.start_char] / ~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^ /KeyError: 152809 / /.... /During handling of the above exception, another exception occurred: / /Traceback (most recent call last): / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/nerwhal/entity_aligner.py", line 34, in _set_start_tok / ent.start_tok = self._char_to_token_idx[ent.start_char] / ~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^ /KeyError: 152719 / /During handling of the above exception, another exception occurred: / /Traceback (most recent call last): / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/uvicorn/protocols/http/h11_impl.py", line 403, in run_asgi / result = await app( # type: ignore[func-returns-value] / ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/uvicorn/middleware/proxy_headers.py", line 60, in call / return await self.app(scope, receive, send) / ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/fastapi/applications.py", line 1054, in call / await super().call(scope, receive, send) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/applications.py", line 113, in call / await self.middleware_stack(scope, receive, send) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/middleware/errors.py", line 186, in call / raise exc / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/middleware/errors.py", line 164, in call / await self.app(scope, receive, _send) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/middleware/cors.py", line 93, in call / await self.simple_response(scope, receive, send, request_headers=headers) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/middleware/cors.py", line 144, in simple_response / await self.app(scope, receive, send) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/middleware/exceptions.py", line 63, in call / await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app / raise exc / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/_exception_handler.py", line 42, in wrapped_app / await app(scope, receive, sender) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/routing.py", line 716, in call / await self.middleware_stack(scope, receive, send) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/routing.py", line 736, in app / await route.handle(scope, receive, send) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/routing.py", line 290, in handle / await self.app(scope, receive, send) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/routing.py", line 78, in app / await wrap_app_handling_exceptions(app, request)(scope, receive, send) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app / raise exc / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/_exception_handler.py", line 42, in wrapped_app / await app(scope, receive, sender) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/starlette/routing.py", line 75, in app / response = await f(request) / ^^^^^^^^^^^^^^^^ / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/fastapi/routing.py", line 302, in app / raw_response = await run_endpoint_function( / ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/fastapi/routing.py", line 213, in run_endpoint_function / return await dependant.call(**values) / ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ / File "/home/kwiederhold/OpenRedact/openredact-app/backend/app/endpoints.py", line 223, in find_piis / res = nerwhal.recognize( / ^^^^^^^^^^^^^^^^^^ / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/nerwhal/core.py", line 128, in recognize / entity_aligner.align_entities_with_tokens(ents, tokens) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/nerwhal/entity_aligner.py", line 20, in align_entities_with_tokens / self._set_tokens_in_entities(ents) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/nerwhal/entity_aligner.py", line 28, in _set_tokens_in_entities / self._set_start_tok(ent) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/nerwhal/entity_aligner.py", line 38, in _set_start_tok / self._set_start_tok(ent) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/nerwhal/entity_aligner.py", line 38, in _set_start_tok / self._set_start_tok(ent) / File "/home/kwiederhold/OpenRedact/openredact-app/backend/.venv/lib/python3.12/site-packages/nerwhal/entity_aligner.py", line 38, in _set_start_tok / self._set_start_tok(ent) / [Previous line repeated 963 more times]

RecursionError: maximum recursion depth exceeded