More Ways to Distinguish Mickey Mouse from Interesting Examples
If we do careful and correct measures of significance,
we will see night and day differences between the expected and interesting
cases. To evaluate a table from the Torah, for example: the best methods
involve some combination of:
-
mathematical calculation
-
computer comparisons of ELS's in the Torah to ELS's in
thousands of "monkey texts"
(each such text is a purposely mixed-up version of the Torah; for
example, the text has the
same words within each verse as the real Torah, but in a different
order).
-
computer comparisons of ELS's in the Torah to ELS's in
other real Hebrew texts, modern and ancient.
The tests tend to give answers
in reasonable agreement with each other.
For example, if a properly done
calculation results in a probability of 1 in 10,000;
we find that we have to search several thousand texts
to find one as good as the Torah. And we conclude that we
are looking at an interesting example.
But here, as in all of the steps, we must be careful:
we need to be able to properly evaluate whether a
text is "as good as the Torah".
We must follow the same rules as used for the Torah, but
we must not require the same exact ELS's unless those are truly the only
choices. Rather we
accept
a table of similar relevance and
rarity, arranged at least as compactly as the
table found in the Torah.
Doing this fairly, without bias, once again
requires careful analysis.
The critical step is to give each comparison
text the same level of choice as was available in the original Torah find.
And the Mickey Mouse examples often are reported with outrageous odds
because this step - if done at all - was done using completely
incorrect procedures: such as using scenario 1 for the Torah find
but forcing scenario 2 onto the comparison texts.
There can also be very subtle issues of differences between real texts and
monkey texts; or dependencies, such as between the words being
searched. But with enough repetitions and varied
types of tests, and enough varied categories of codes, these problems are
recognized and circumvented.
Back One Level
Back to Home Page