Skip to content

Remove common names that are also titles (like Prince) from stopwords#159

Open
leonhandreke wants to merge 1 commit intomainfrom
leonhandreke/remove-common-names-from-stopwords
Open

Remove common names that are also titles (like Prince) from stopwords#159
leonhandreke wants to merge 1 commit intomainfrom
leonhandreke/remove-common-names-from-stopwords

Conversation

@leonhandreke
Copy link
Contributor

@leonhandreke leonhandreke commented Jan 20, 2026

From the magical LLM:

Summary by frequency
Very common surnames: King, Prince, Sultan
Moderately common surnames: Major, Duke, Earl, Master
Common in specific regions: Sheikh, Emir, Doctor (South Asia)
Rare but exist: Baron, Count, General, Captain, Colonel, Lord, Lady, Reverend, President, Director, Herr, Frau

Somehow it didn't pick up on the female variants when I asked it to review all the names (in a surprising turn of events, maybe there is still a point to feminism after all!), but a second prompt yielded similar results for the female versions of these names.

@leonhandreke leonhandreke force-pushed the leonhandreke/remove-common-names-from-stopwords branch from bc1270d to 53e69d6 Compare January 20, 2026 15:36
From the magical LLM:

> Summary by frequency
> Very common surnames: King, Prince, Sultan
> Moderately common surnames: Major, Duke, Earl, Master
> Common in specific regions: Sheikh, Emir, Doctor (South Asia)
> Rare but exist: Baron, Count, General, Captain, Colonel, Lord, Lady, Reverend, President, Director, Herr, Frau

Somehow it didn't pick up on the female variants when I asked it to review all the names (in a surprising turn of events, maybe there is still a point to feminism after all!), but a second prompt yielded similar results for the female versions of these names.
@leonhandreke leonhandreke force-pushed the leonhandreke/remove-common-names-from-stopwords branch from 53e69d6 to 041b607 Compare January 20, 2026 15:37
@leonhandreke leonhandreke requested a review from pudo January 20, 2026 15:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant