Skip to content

Parser suddenly quits while parsing Discogs dump #13

@aleksblendwerk

Description

@aleksblendwerk

Hi there,

as I am currenty looking to speed up my database import code for Discogs' dump files, I just tried your library with this file: https://discogs-data-dumps.s3-us-west-2.amazonaws.com/data/2024/discogs_20240701_labels.xml.gz and I might be using it wrong anyway but it also seems to stop after a couple thousand nodes.

This is more or less my code:

$stream = fopen('compress.zlib://[...]/discogs/discogs_20240701_labels.xml.gz', 'rb');

foreach (new XMLParser($stream) as $node) {
    if ($node instanceof XMLNodeContent && $node->name === 'label') {
        var_dump($node->content);
    }
}

fclose($stream);

The output ends with

string(67) "https://web.archive.org/web/20160427071301/http://www.exogenic.com/"
string(17) "Breakbeat Science"
string(17) "Breakbeat Science"

Somehow parsing suddenly ends at about 1% into the file.

I haven't investigated this further yet, will look elsewhere for now but I just thought I'd report it.

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions