Roll & Rock

[ Roll & Rock ] [ Main Menu ]


  


21159


Date: March 09, 2020 at 13:26:33
From: Skywise, [DNS_Address]
Subject: Roger, question re Jones p for fractional windows


The Jones p is computed from how many consecutive
windows have hits divided by how many of those
windows there are. Easy when, say, it's a 5 day window
tested against 500 days of record - 100 windows.

But say it was a 6 day window? That's 83 1/3
consecutive windows. What's the math for that? I'm
trying different ideas but I'm getting non-sensical
results.

Brian


Responses:
[21165] [21166] [21167] [21168] [21169] [21170] [21172] [21173] [21174] [21175] [21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21165


Date: March 09, 2020 at 16:34:48
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Brian;

You just have to adjust the test range until it comes
out even.

If it's long enough it shouldn't matter anyway.

Roger


Responses:
[21166] [21167] [21168] [21169] [21170] [21172] [21173] [21174] [21175] [21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21166


Date: March 09, 2020 at 19:45:29
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


That's what I thought, as I couldn't come up with any sort of weighting factor for the partial window that would come up with the right answer.

I'm still messing around with small samples, and in this case, situations where I already know what the answer is.

BTW, in such a case, my overlapping windows method works and comes up with the right probability. But obviously much more work. I agree it's easier to simply adjust the search record to fit even multiples of the prediction window and use the consecutive window method.

Brian


Responses:
[21167] [21168] [21169] [21170] [21172] [21173] [21174] [21175] [21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21167


Date: March 09, 2020 at 20:14:15
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Skywise;

Remember too, the question is how many times in
history has this prediction been true. If you mess
with the window, shifting it a day at a time,
eventually it will miss the quake.

Roger


Responses:
[21168] [21169] [21170] [21172] [21173] [21174] [21175] [21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21168


Date: March 09, 2020 at 20:47:14
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


I've tested that empirically. (9 different scenarios so far)

My overlapping windows gets the same result as Jones'
consecutive windows *IF* the average of all distributions
is taken.

THAT is the real argument I have. Taking only a single
distribution does not result in an accurate probability.

I'm working on putting together the proof.

Brian


Responses:
[21169] [21170] [21172] [21173] [21174] [21175] [21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21169


Date: March 09, 2020 at 21:06:27
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Skywise;

How are you determining the correct probability?

Roger


Responses:
[21170] [21172] [21173] [21174] [21175] [21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21170


Date: March 09, 2020 at 22:33:25
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


In attempting to write a quick summary example for you,
I think I made have made an error somewhere.

This is good. It means I'm right to not be too sure of
myself just yet. Hence why I keep trying different
scenarios.

The whole point is, I want to make sure I truly understand
the math involved, rather than just using it blindly.

Brian


Responses:
[21172] [21173] [21174] [21175] [21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21172


Date: March 10, 2020 at 07:04:19
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Brian;

I asked Dr Jones if he had a paper describing it but
he does not.

Probability is the number of times a thing happens
divided by the number of times it could have
happened.

The "thing" here is a quake of a certain size and
location within a specific 10 day window. By
stepping thru windows a day at a time you're
involving different, unspecified windows.

Roger


Responses:
[21173] [21174] [21175] [21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21173


Date: March 10, 2020 at 07:10:54
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Brian;

What I don't understand is that the number of quakes
in a predicted window does not matter.

It seems to me, if window A has 100 quakes while
window B has only 1, the odds are different.

Dr Jones says no.

Roger


Responses:
[21174] [21175] [21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21174


Date: March 10, 2020 at 07:53:57
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


And I think you ARE right.

What got me started was I was just following a curiosity
hunch.

Using some samples from one of my tests...

A prediction has a 4 day window.
The 'historical record' is 12 days long.
This means there are 12/4 = 3 consecutive windows.

That means there are 3 test windows that can have a hit.

If for example we salt the 12 day 'historical record'
with 3 days of quakes, there are 220 possible 12 day
patterns that will have quakes on 3 days.

So let's look at the Jones p for some of those 12 day
patterns.

HHHm mmmm mmmm = 0.333
HmmH mmHm mmmm = 0.667
Hmmm mmHm mHmm = 1.000

As you can see, the Jones p changes wildly depending on
the exact distribution of the quakes in the record.

Given we salted the record to have 3 days out of 12
with quakes, the odds SHOULD be 3/12 or 0.250.

What I just realized when I was trying to give an example
before was that the average p method I was using was
NOT giving the expected odds of 0.250. So I realized I
was on the wrong track or missed something. I think I'm
still right that JUST using the p value of the given
historical record will not be correct. It may by chance
be close, but it may still be way off depending on how
the quakes cluster.

I'm now looking at a new idea.

Brian


Responses:
[21175] [21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21175


Date: March 10, 2020 at 08:09:19
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Skywise;

The historical record is all we have so it better be
right!

But the probability can change over time as new areas
become active or active areas die down.

An exact answer isn't possible.

Roger


Responses:
[21176] [21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21176


Date: March 10, 2020 at 08:34:55
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Yes, the historical record is all we have for a sequence
of events with truly unknown probability.

We may not be able to get an exact answer, but we should
always strive to be as accurate as possible. We should
always question ourselves and our methods. Basic science
methodology.

Even if I'm going down a wrong path, at least I myself
will have a better understanding of the problem.

Brian


Responses:
[21177] [21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21177


Date: March 10, 2020 at 09:10:07
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Occam's Razor. The simplest answer is most likely the
right one.

Using my same example from before of a 12 day 'historical
record' with 3 days having quakes - well, the simple
answer is 3/12 = .25, which is the expected answer.

The probability is simply the number of days with quakes
divided by the total number of days (or whatever the time
resolution is). No need for complicating things with
consecutive or overlapping windows. This also declusters
the catalog as it doesn't matter what the distribution is.
They could all happen in a clump or be spread evenly
across the record.

Which makes me ask, what is the logic behind the
consecutive window thing?

Brian


Responses:
[21180] [21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21180


Date: March 10, 2020 at 10:23:05
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Skywise;

The consecutive windows are all the possible windows.

If they overlapped you'd have too many possible
windows.

The search advances by years. For a given prediction
there is only one correct answer; the window contains
1 or more quakes or it does not. The "or more" cancels
clusters.

Roger


Responses:
[21181] [21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21181


Date: March 10, 2020 at 11:17:08
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


That's not the question.

Why use the consecutive windows thing at all? What is
the justification or logic behind it?

If a given quake history of 1000 days has 137 quake days,
why can't we just say the probability is .137? Simple math.
Number of days with quakes divided by total number of days.

In all the scenarios I've looked at so far, rarely does
the Jones p equal the actual real known p. And I know
the real p because I created the history to reflect
a chosen p. In fact, I looked at all possible variations
of the history that have that chosen p.

Again, my example of a 12 day 'history'. I chose a p of
.250, which means there are 3 of those 12 days with a
quake. Simplicity says 3/12, p = .250.

If the prediction window is 4 days, that means there
are 3 consecutive Jones windows. So that means the only
possible Jones probabilities are 0.000, 0.333, 0.666,
and 1.000. NEVER 0.250 which is the known actual value
of p.

Yes, in a large enough 'history' the Jones p _could_
approach the real p. But it will never equal it except
in certain circumstances, specifically...

If by chance the prediction window is some integer
multiple of the number of days with a quake in the
history, then yes it's possible to have a Jones p the
same as the actual p, but it's still not guaranteed as
it still depends on the distribution of the quake days,
again because the Jones p changes depending on the
actual distribution of those quake days in the history.

For example, if in the 12 day 'history' with a chosen
p of .250, thus 3 of 12 days with a quake, and we instead
chose a 3 day prediction window, this means there are
now 4 Jones windows, and the possible Jones p values
are 0.000, 0.250, 0.500, 0.750, and 1.000. So if by
chance the 3 quake days are distributed so that one and
only one Jones window has a hit, then yes the Jones p
will be the same as the real known p. But given there
are 220 possible history patterns of 3 quake days in
this 12 day 'history', and only 4 of them will have all
3 quake days in one Jones window, then the odds (pardon
the pun) are very much against the Jones p equaling the
real known value of p.

Brian


Responses:
[21182] [21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21182


Date: March 10, 2020 at 12:42:07
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Skywise;

While posting with Dr Jones the light finally came
on in my head!

We take the entire study period and divide it up
into consecutive windows of the predicted length.

Then we look at the quakes for the study period. If
a quake of the right location and mag happens we
drop it into the appropriate time window.

Repeat for all quakes.

Then we look at the windows and count how many have
quakes. That divided by total windows is the odds on
a selected window containing an appropriate quake.

Your sliding windows gives multiple hits.

Roger


Responses:
[21185] [21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21185


Date: March 10, 2020 at 13:52:00
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Forget the sliding windows. I am NOT doing that any more.
I agree that it is not entirely correct, and even if it
were, it's unnecessary work.

I am doing the Jones consecutive windows.

And again, WHY do we do consecutive windows in the first
place? THAT is my question. I don't see the reason for
it. Using it I am getting unexpected answers.

Let me try to illustrate using the limits of a text only
message.

I am making a prediction with a 4 day window.

We are testing against a catalog with 12 days in it.

So this means there are 3 consecutive 4 day Jones
Windows.

We good so far?

Now let's say the 12 day catalog is of the following
pattern for quakes, where 'o' is no quake and 'X' is a
quake for that day.

oXoX oooo oXoo

I've already divided it up into those 3 consecutive
Jones windows. As can plainly be seen, there are two
Jones windows with a hit, and one without. That's 2
out of 3. So the Jones probability is .667.

Right?

OK.

However, shouldn't the probability actually be 3/12 = .250?
After all, there are 3 quake days across 12 days. 3
divided by 12 equals .250.

Let me illustrate why.

For 12 days with exactly 3 of them as quake days, there
are 220 possible combinations. I have written out all
220 combinations in a spread sheet so I can figure out
the Jones p for each one.

Here's a few examples:

XXXo oooo oooo 0.333
oXoo XoXo oooo 0.667
ooXo oooX Xooo 0.667
oooo ooXX Xooo 0.667
oooo XoXX oooo 0.333
Xooo ooXo oooX 1.000
oooX oooX Xooo 1.000


Since there are 3 Jones windows, that means there are
2^3-1 possible combinations of Jones windows with at
least one hit, where m = miss and H = hit, and
following each is the resulting Jones p.

mmH = 0.333
mHm = 0.333
mHH = 0.667
Hmm = 0.333
HmH = 0.667
HHm = 0.667
HHH = 1.000

(0.000, all three misses is not included because that
means there are no quakes, and we are looking only at
possibilites that have exactly 3 quakes)

Still following me?

So there are 3 possible Jones p results. Listed below
are those three results followed by how many of those
220 possible combinations mentioned above result in
that particular Jones p value.

0.333 = 12
0.667 = 144
1.000 = 64

As you can plainly see, depending on which of the 220
possible distributions of exactly 3 quake days in a 12
day catalog is evaluated, there could be any one of
three possible Jones p values.

Now remember, we are looking at 3 quake days out of 12
days. Does that not mean the probability is 3/12 or
0.250? It doesn't matter the distribution of those three
quake days. It could be any one of those 220 possible
variations. They all have 3 out of 12.

If so, in this particular case, the Jones p value is
NEVER right.

Yes, in reality, there is only one quake history. But
if a real 12 day catalog only contains 3 quake days,
what is the correct p value?

One of the three possible Jones p values which depend
on the particular distribution of the quakes?

Or simply 3/12 = 0.250?

Brian


Responses:
[21187] [21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21187


Date: March 10, 2020 at 14:14:39
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Skywise;

It all depends on how they happen.

The windows are consecutive so that all days are
covered.

Days don't count; windows count. There are 3 in your
example.

If all quakes fall in 1 window odds are 1/3

all in 2 days = 2/3

all in 3 windows, 3/3

Those are the only possibilities.

Roger


Responses:
[21188] [21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21188


Date: March 10, 2020 at 15:10:59
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Yet, we know the real odds are 1/4.

Not 1/3. Not 2/3. Not 3/3.

Brian


Responses:
[21191] [21193] [21194] [21195] [21196] [21197] [21200] [21203]


21191


Date: March 10, 2020 at 18:24:57
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Wrong.

It's window probability, not day probability.

Roger


Responses:
[21193] [21194] [21195] [21196] [21197] [21200] [21203]


21193


Date: March 10, 2020 at 20:06:56
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


I've written code to simulate large scenarios. I can
adjust record size, prediction window size, and
predetermined probability at will.

When using record sizes of thousands, if not tens of
thousands, yeah, I sorta see it. I just don't quite
understand it.

Statistics is not intuitive nor easy.

Brian


Responses:
[21194] [21195] [21196] [21197] [21200] [21203]


21194


Date: March 10, 2020 at 20:21:06
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Brian;

Look at it this way; you want the probability of what
is predicted, not something else similar.

So it's THIS location +/- 3 degrees, not something
+2/-4 degrees.

Roger


Responses:
[21195] [21196] [21197] [21200] [21203]


21195


Date: March 10, 2020 at 20:53:31
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


"Look at it this way; you want the probability of
what is predicted, not something else similar."


Granted. But how do we know the probability?

I don't think you are quite understanding my question.

WHY is the method Jones uses the correct one? That is
what I'm struggling with. It doesn't make sense that
the distribution of the quakes can change the
probability.

If I have a 10,000 day catalog and test one of Shan's
20 day windows, that's 500 consecutive windows.

What if I change the start day of my analysis? Guess
what? The Jones probability changes because I've just
shifted the distribution of the quakes.

Do you not see this?

Hell, try it yourself. You say you use the USGS catalog
from 1974. Try shifting the start date by a day at a
time and watch what happens to the resulting
probability for a given prediction.

Brian


Responses:
[21196] [21197] [21200] [21203]


21196


Date: March 11, 2020 at 08:33:25
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Skywise;

Of course; the probability depends on the dataset.

So is doing it your way.

You expect absolute truth?

That's why we use a very large historical record, to
get the best approximation possible within reason.

Roger


Responses:
[21197] [21200] [21203]


21197


Date: March 11, 2020 at 08:37:34
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


PS: Have you checked your answer using a large dataset
compared to Jones for the same dataset?

They should be very close.

Roger


Responses:
[21200] [21203]


21200


Date: March 11, 2020 at 13:49:08
From: Skywise, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


I've been a silly boy.... my 'expected' probability was
all wrong. I was looking at it the wrong way. My mistake
is that although I may be salting the test record with
a 10% probability, I failed to note that that is the
DAILY probability. But what is the probability across
the length of the prediction window? ooooops.

This just goes to show that statistics is a tricky bugger.
And it's easy to make simple mistakes.

I've got the right formula now, and the Jones p does come
very close to this number.

I just did a VERY large test run now to see how well it
converges. I averaged 1,000 runs of 100,000 day records
with exactly 10% random probability analyzing a 20 day
window.

The expected probability was 0.878423.

min Jones p: 0.868000
avg Jones p: 0.878562
max Jones p: 0.889400

As you can see, it's only off by about 0.016% on
average, and at most about 1.25%.

Brian


Responses:
[21203]


21203


Date: March 11, 2020 at 15:41:34
From: Roger Hunter, [DNS_Address]
Subject: Re: Roger, question re Jones p for fractional windows


Skywise;

Good enough for government work as we used to say when
I was a government worker.

Roger


Responses:
None


[ Roll & Rock ] [ Main Menu ]

Generated by: TalkRec 1.17
    Last Updated: 30-Aug-2013 14:32:46, 80837 Bytes
    Author: Brian Steele