Skip to content

Better outlier detection procedure #2

@memoryfull

Description

@memoryfull

RFSD companion paper writes:

<...> we engaged in manual review of top-20 firms in terms of revenue or total assets within each 2-digit industry (excluding financial firms), firms with largest year-on-year changes in key financials, and firms with imputed statements and largest revenues. Our review has identified 436 firms that filed 1,130 anomalous statements in 2011–2023. The judgement was made based on the audit opinions, financials of known industry leaders, firm websites, or public information regarding the firms suspected of reporting anomalous values. We recommend RFSD users to exclude those companies from consideration and do so in our external validation.

The existing procedure to find firms with anomalous filings is manual by design. We have experimented with various rule-based procedures to detect outliers, comparing year-on-year changes in financials within firms or between-firm financials within industry. These experiments have yielded mixed results due to a large false positive rate. We would welcome any discussion of a reliable automated outlier detection procedure for the RFSD.

Background: https://t.me/rumka_ipp/792

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requesthelp wantedExtra attention is needed

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions