Rework PNG metadata extraction #2653

1c3t3a · 2025-11-21T06:18:04Z

This change implements gaps we observed in the current Metadata extraction for PNG. There are multiple significant changes:

For all metadata types, also search for metadata in the zTXt, tEXT and iTXt chunk.
If we find metadata in the zTXt chunk, handle it like ImageMagick metadata and interpret it as a hex string prefixed by a header, that is separated by newlines.

We did significant testing on these methods internally and ran it against all test files we could find. We also ran it against all test images in libpng and pngsuite, and so far did not find a single image that had metadata in the zTXt chunk that was not compressed like the ImageMagick hex string.

This change implements gaps we observed in the current Metadata extraction for PNG. There are multiple significant changes: - For all metadata types, also search for metadata in the zTXt, tEXT and iTXt chunk. - If we find metadata in the zTXt chunk, handle it like ImageMagick metadata and interpret it as a hex string prefixed by a header, that is separated by newlines. We did significant testing on these methods internally and ran it against all test files we could find. We also ran it against all test images in libpng and pngsuite, and so far did not find a single image that had metadata in the zTXt chunk that was not compressed like the ImageMagick hex string.

197g

I like it, the internal test data sounds encouraging. Regarding images, what's the status of attribution to include them here?

197g · 2025-11-21T10:20:46Z

tests/metadata.rs

+        56, 66, 73, 77, 4, 4, 0, 0, 0, 0, 0, 49, 28, 2, 110, 0, 24, 65, 73, 45, 71, 101, 110, 101,
+        114, 97, 116, 101, 100, 32, 119, 105, 116, 104, 32, 71, 111, 111, 103, 108, 101, 28, 2, 90,
+        0, 8, 75, 105, 110, 103, 115, 116, 111, 110, 28, 2, 0, 0, 2, 0, 4, 0, 56, 66, 73, 77, 4,
+        37, 0, 0, 0, 0, 0, 16, 67, 89, 196, 70, 206, 234, 16, 4, 50, 89, 230, 125, 147, 191, 230,
+        81,


I assume this changed as a result of the hex decoding? At least the original test side looks like mostly hex digits from a glance.

197g · 2025-11-21T10:26:01Z

src/codecs/png.rs

+    let parse = || {
+        let mut parts = buffer.splitn(4, '\n');
+
+        // Skip the first two parts, grab the size (3rd part), then the body.


Nit: We're now dropping the contents of the second part, what would be contained in it? In the example it only says Optional but if I understand the imagemagick source for it it is any ascii whitespace padding until a non-ws character.

fintelia · 2025-11-21T16:32:12Z

src/codecs/png.rs

+///
+/// This method parses such data and returns the decoded payload.
+/// If the structure doesn't match, we return and empty `Vec<u8>`.
+fn parse_raw_profile(buffer: &str, header: Option<&str>) -> Vec<u8> {


Is this something that would make sense to expose as a public method from the png crate?

Oh yeah, that would really make sense! It should be a method on ZTXtChunk. I will open a PR shortly to add that.

fintelia · 2025-11-21T16:34:28Z

src/codecs/png.rs

+        let size_str = parts.nth(2)?;
+        let body = parts.next()?;
+
+        // Parse size and validate a 4MB limit.


Where does this 4MB come from? Is it from the underlying standards, or something you're adding to avoid running out of memory?

1c3t3a force-pushed the png-metadata branch 2 times, most recently from a0e290d to 0275027 Compare November 21, 2025 06:21

1c3t3a force-pushed the png-metadata branch from 0275027 to e49014c Compare November 21, 2025 06:46

197g reviewed Nov 21, 2025

View reviewed changes

fintelia reviewed Nov 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rework PNG metadata extraction #2653

Rework PNG metadata extraction #2653

1c3t3a commented Nov 21, 2025

Uh oh!

197g left a comment

Uh oh!

197g Nov 21, 2025

Uh oh!

197g Nov 21, 2025

Uh oh!

fintelia Nov 21, 2025

Uh oh!

1c3t3a Nov 22, 2025

Uh oh!

fintelia Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Rework PNG metadata extraction #2653

Are you sure you want to change the base?

Rework PNG metadata extraction #2653

Conversation

1c3t3a commented Nov 21, 2025

Uh oh!

197g left a comment

Choose a reason for hiding this comment

Uh oh!

197g Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

197g Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

fintelia Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

1c3t3a Nov 22, 2025

Choose a reason for hiding this comment

Uh oh!

fintelia Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants