I wondered just how many errors were made in spelling names in the United States census. How would someone ever figure that out, with all of the known spelling variations of surnames.
Then I thought - what about SMITH, how badly could the enumerators mess that name up? Even the indexers shouldn't be able to mess it up, right?
I used the 1920 U.S. Census for no particular reason. Here are my results:
1) SMITH spelling: 1,148,776 indexed entries (including corrections)
2) SMIHT spelling: 85 indexed entries
3) SMTIH spelling: 1,341 indexed entries
4) SIMTH spelling: 457 indexed entries
5) STMIH surname: 2 indexed entries
5) MSITH spelling: 14 indexed entries (all corrections)
6) SMTH spelling: 457 indexed entries
7) SMHIT spelling: 3 indexed entries
8) SNITH spelling: 632 indexed entries (but is SNITH a real surname?)
Assuming all of those alternates are wrong, they total to 2,991. That is .00260 of the correct SMITH name, or 0.26%. That's one in every 385 Smith listings.
The numbers don't include other legitimate alternate spellings like Smithe, Smyth, Smythe, etc.
What other surname could be used?
My view is that at least 0.2% (i out of 500) surnames in the U.S. census are misspelled by the enumerators or the indexers.
This doesn't surprise me, because I've seen SEAVER badly mangled by enumerators and indexers. By far the most obvious is the alternate name LEAVER caused by confusion of the script upper-case S for script upper-case L. But LEAVER is a legitimate surname in its own right.
The URL for this post is: http://www.geneamusings.com/2015/05/1920-us-census-errors-for-smith-surname.html
Copyright (c) 2015, Randall J. Seaver
Please comment on this post on the website by clicking the URL above and then the "Comments" link at the bottom of each post. Or contact me by email at randy.seaver@gmail.com.
Welcome to my genealogy blog. Genea-Musings features genealogy research tips and techniques, genealogy news items and commentary, genealogy humor, San Diego genealogy society news, family history research and some family history stories from the keyboard of Randy Seaver (of Chula Vista CA), who thinks that Genealogy Research Is really FUN! Copyright (c) Randall J. Seaver, 2006-2024.
2 comments:
Yikes!
I've certainly seen a lot of errors Randy. It seems like a small percentage, unless it's someone your researching and can't locate. Good post. Thanks.
Post a Comment