Finding 2
Section-aware extraction makes the error category structurally unreachable
The document contains three business entities — Apex Staffing, Excel Manufacturing, and Franklin Logistics — before the Employee Information section that contains the claimant's name. Approach B scanned the full document; Approach C partitioned it.
Each regex pattern runs only against its assigned section pool. The first_name pattern sees only text under the Employee Information header. "Franklin" exists only in the Employer Information pool. The two pools never intersect. The attribution error is not a probability to manage — it is a structural impossibility.
Section partitioning (Python)
SECTION_MAP = {
"employer": re.compile(
r"employer\s+information", re.I),
"employee": re.compile(
r"(?:injured\s+)?employee\s+information", re.I),
"injury": re.compile(
r"(?:injury|incident)\s+(?:information|details)", re.I),
}
# first_name runs in "employee" pool only.
# "Franklin" exists in "employer" pool only.
# No overlap. Attribution error is impossible.