Skip to content

Add sample log files for benchmark tasks#179

Merged
olearycrew merged 1 commit intomainfrom
add-log-assets
Apr 8, 2026
Merged

Add sample log files for benchmark tasks#179
olearycrew merged 1 commit intomainfrom
add-log-assets

Conversation

@ScuttleBot
Copy link
Copy Markdown

This PR adds diverse sample log files to assets/logs/ for testing log parsing and analysis tasks.

Log Files Added

File Type Source Size Lines
apache_error.log Web server error logs Loghub Apache 96KB 1,000
linux_syslog.log System logs Loghub Linux 510KB 5,000
hdfs_datanode.log Distributed file system Loghub HDFS 284KB 2,000
openssh_auth.log Authentication/Security Loghub OpenSSH 165KB 1,500
hadoop_mapreduce.log Application logs Loghub Hadoop 235KB 1,282
nginx_access_json.log JSON access logs Elastic Examples 228KB 1,000

Sources

All files are samples extracted from larger datasets to keep sizes manageable (all under 500KB). Includes README.md with detailed format descriptions and attribution.

Add diverse log samples from Loghub and Elastic:
- Apache error logs (web server)
- Linux syslog (system logs)
- HDFS DataNode logs (distributed system)
- OpenSSH auth logs (security/auth)
- Hadoop MapReduce logs (application)
- NGINX JSON access logs (structured format)

All files under 500KB for efficient testing.
Includes README with attribution and format details.
@olearycrew olearycrew merged commit 2286c45 into main Apr 8, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants