[3.15] gh-151497: Avoid huge pre-allocation for oversized tarfile extended headers (GH-151498) by miss-islington · Pull Request #151977 · python/cpython

miss-islington · 2026-06-23T09:44:49Z

tarfile reads a member's extended header (a GNU long name/link or a pax
header) with a single read sized by the header's size field:

buf = tarfile.fileobj.read(self._block(self.size))

The size is taken from the archive and is not validated, so a ~512-byte
crafted file can claim several gigabytes (or, via base-256 encoding, far
more) and make read() pre-allocate that much memory -- on open/iterate,
before any extraction filter runs.

Read the extended-header data in bounded chunks instead, so an oversized
or truncated header can no longer force a huge allocation. The bytes
returned for valid archives are unchanged.
(cherry picked from commit da99711)

Co-authored-by: Shardul Deshpande iamsharduld@users.noreply.github.com

Issue: tarfile: memory exhaustion via oversized extended-header (GNU long name / pax) size field #151497

…nded headers (pythonGH-151498) tarfile reads a member's extended header (a GNU long name/link or a pax header) with a single read sized by the header's size field: buf = tarfile.fileobj.read(self._block(self.size)) The size is taken from the archive and is not validated, so a ~512-byte crafted file can claim several gigabytes (or, via base-256 encoding, far more) and make read() pre-allocate that much memory -- on open/iterate, before any extraction filter runs. Read the extended-header data in bounded chunks instead, so an oversized or truncated header can no longer force a huge allocation. The bytes returned for valid archives are unchanged. (cherry picked from commit da99711) Co-authored-by: Shardul Deshpande <iamsharduld@users.noreply.github.com>

miss-islington requested a review from ethanfurman as a code owner June 23, 2026 09:44

This was referenced Jun 23, 2026

tarfile: memory exhaustion via oversized extended-header (GNU long name / pax) size field #151497

Open

gh-151497: Avoid huge pre-allocation for oversized tarfile extended headers #151498

Merged

bedevere-app Bot added the awaiting review label Jun 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[3.15] gh-151497: Avoid huge pre-allocation for oversized tarfile extended headers (GH-151498)#151977

[3.15] gh-151497: Avoid huge pre-allocation for oversized tarfile extended headers (GH-151498)#151977
miss-islington wants to merge 1 commit into
python:3.15from
miss-islington:backport-da99711-3.15

miss-islington commented Jun 23, 2026 •

edited by bedevere-app Bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

miss-islington commented Jun 23, 2026 • edited by bedevere-app Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

miss-islington commented Jun 23, 2026 •

edited by bedevere-app Bot

Loading