Skip to content

Major scaling problems processing when dynamically yielding huge amount of tasks for parallel execution #4

@proycon

Description

@proycon

The luigi scheduler doesn't seem to cope well with a huge amount (300,000 in my test) of scheduled tasks (siblings in the dependency graph). Scheduling becomes the bottleneck and load is not distributed over available workers.

Attempting to bypass this issue by grouping tasks in batches...

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions