Apache OpenNLP: OOM DoS via Unbounded Array Allocation in AbstractModelReader

Summary

CVE	CVE-2026-42440
State	PUBLISHED
Assigner	apache
Source Priority	CVE Program / NVD first with legacy fallback
Published	2026-05-04 17:16:26 UTC
Updated	2026-07-03 13:17:12 UTC
Description	OOM Denial of Service via Unbounded Array Allocation in Apache OpenNLP AbstractModelReader Versions Affected: before 1.9.5 before 2.5.9 before 3.0.0-M3 Description: The AbstractModelReader methods getOutcomes(), getOutcomePatterns(), and getPredicates() each read a 32-bit signed integer count field from a binary model stream and pass that value directly to an array allocation (new String[numOutcomes], new int[numOCTypes][], new String[NUM_PREDS]) without validating that the value is non-negative or within a reasonable bound. The count is therefore fully attacker-controlled when the model file originates from an untrusted source. A crafted .bin model file in which any of these count fields is set to Integer.MAX_VALUE (or any value large enough to exhaust the available heap) triggers an OutOfMemoryError at the array allocation itself, before the corresponding label or pattern data is consumed from the stream. The error occurs very early in deserialization: for a GIS model, getOutcomes() is reached after only the model-type string, the correction constant, and the correction parameter have been read; so the attacker pays no meaningful size cost to weaponize a payload, and a single small file can crash a JVM that loads it. Any code path that deserializes a .bin model is affected, including direct use of GenericModelReader and any higher-level component that delegates to it during model load. The practical impact is denial of service against processes that load model files from untrusted or semi-trusted origins. Mitigation: * 2.x users should upgrade to 2.5.9. * 3.x users should upgrade to 3.0.0-M3. Note: The fix introduces an upper bound on each of the three count fields, checked before array allocation; counts that are negative or exceed the bound cause an IllegalArgumentException to be thrown and the read to fail fast with no large allocation. The default bound is 10,000,000, which is well above the entry counts of legitimate OpenNLP models but far below any value that would threaten heap exhaustion. Deployments that legitimately need to load models with more entries than the default can raise the limit at JVM startup by setting the OPENNLP_MAX_ENTRIES system property to the desired positive integer (e.g. -DOPENNLP_MAX_ENTRIES=50000000); invalid or non-positive values fall back to the default. Users who cannot upgrade immediately should treat all .bin model files as untrusted input unless their provenance is verified, and should avoid loading models supplied by end users or fetched from third-party repositories without integrity checks.

Risk And Classification

Primary CVSS: v3.1 7.5 HIGH from ADP

CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H

EPSS: 0.006270000 probability, percentile 0.456940000 (date 2026-07-05)

Problem Types: CWE-789 | CWE-770 | CWE-789 CWE-789: Memory Allocation with Excessive Size Value | CWE-770 Allocation of Resources Without Limits or Throttling

Version	Source	Type	Score	Severity	Vector
3.1	ADP	DECLARED	7.5	HIGH	`CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H`
3.1	ADP	CVSS	7.5	HIGH	`CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H`
3.1	134c704f-9b21-4f2e-91b3-4a467353bcc0	Secondary	7.5	HIGH	`CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H`
3.1	0b0ca135-0b70-47e7-9f44-1890c2a1c46c	Secondary	7.5	HIGH	`CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H`

CVSS v3.1 Breakdown

Attack Vector

Network

Attack Complexity

Low

Privileges Required

None

User Interaction

None

Scope

Unchanged

Confidentiality

None

Integrity

None

Availability

High

CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H

NVD Known Affected Configurations (CPE 2.3)

Type	Vendor	Product	Version	Update	Edition	Language
Application	Apache	Opennlp	All	All	All	All
Application	Apache	Opennlp	3.0.0	m1	All	All
Application	Apache	Opennlp	3.0.0	m2	All	All

Vendor Declared Affected Products

Source	Vendor	Product	Version	Platforms
CNA	Apache Software Foundation	Apache OpenNLP	affected 2.0 2.5.9 semver	Not specified
CNA	Apache Software Foundation	Apache OpenNLP	affected 3.0.0-M1 3.0.0-M3 semver	Not specified
CNA	Apache Software Foundation	Apache OpenNLP	affected 1.9.5 semver	Not specified
ADP	Red Hat	Red Hat Fuse 7	Not specified	Not specified
ADP	Red Hat	Red Hat JBoss Enterprise Application Platform Expansion Pack	Not specified	Not specified
ADP	Red Hat	Red Hat Data Grid 8	Not specified	Not specified
ADP	Red Hat	Red Hat OpenShift AI RHOAI	Not specified	Not specified

References

Reference	Source	Link	Tags
www.openwall.com/lists/oss-security/2026/05/01/21	af854a3a-2127-422b-91ae-364da2661108	www.openwall.com	Mailing List, Third Party Advisory
security.access.redhat.com/data/csaf/v2/vex/2026/cve-2026-42440.json	0b0ca135-0b70-47e7-9f44-1890c2a1c46c	security.access.redhat.com
lists.apache.org/thread/s8xlkx1gqbxfsq48py5h6jphjvgqp1jo	[email protected]	lists.apache.org	Mailing List, Vendor Advisory
bugzilla.redhat.com/show_bug.cgi	0b0ca135-0b70-47e7-9f44-1890c2a1c46c	bugzilla.redhat.com
access.redhat.com/security/cve/CVE-2026-42440	0b0ca135-0b70-47e7-9f44-1890c2a1c46c	access.redhat.com
CVE Program record	CVE.ORG	www.cve.org	canonical
NVD vulnerability detail	NVD	nvd.nist.gov	canonical, analysis

Vendor Comments And Credit

Discovery Credit

CNA: Subramanian S (en)

Additional Advisory Data

Source	Time	Event
ADP	2026-05-04T19:01:44.897Z	Reported to Red Hat.
ADP	2026-05-04T16:40:32.503Z	Made public.

There are currently no legacy QID mappings associated with this CVE.