Endurance.Net Ridecamp Archives

Home Current News News Archive Shop/Advertise Ridecamp Classified Events Learn/AERC

All Endurance.Net Advertisers

Belesemo Arabians

Cypress Trails Equestrian Center,Sales, Training, Boarding

Distance Depot Tack and Equipment

DWA Arabians

Endurance Ride Photographers Guild

EuroXciser

Owyhee Endurance Rides

Parry Harness and Tack/Running Bear

PNER

Slypner Gear Trail Supplies

SWITnDR

Synergist Saddles

Tevis Cup Magic

Steph Teeter Artistry

Centropix Kloud PEMF

[Archives Index] [Date Index] [Thread Index] [Author Index] [Subject Index]

Next by date: RE: [RC] LD/50 racing vs none racing times - David LeBlanc
Previous by date: [RC] South Africa riders travelling in USA - Steph Teeter

RE: [RC] LD/50 times - David LeBlanc

Diane said:

Truman did a study on speeds and deduced from the records 
that the top LDs rode more slowly than the top 50s.  What did 
he use for this study?  The AERC ride result records.  Are 
these records accurate?

That last is the question I was addressing.  My point was 
that if there are SOME records so badly out of whack that one 
can spot them knowing nothing about the ride in question, how 
many more are equally wrong but just harder to spot?


What we have here is a common problem. Data are noisy. Any sample will have
data that's either in error, or is affected by something you may not be able
to analyze for.

So what you can do is first weed out the data points which are clearly in
error. For example, if I look at the speed distribution for LD rides for a
few hundred rides, I'll see a bell curve, but there's a spike on the end
where everyone was reported as taking 6 hours. To do things right, that
should be documented, and you then don't use those rides (or the
corresponding 50's) for the analysis. I did that.

For 
instance, according to the record, the Tidioute Challenge 25 
on 08-Jul-06 was won in 4:16.  WON!  It is not impossible 
that the winning time was so slow but when the winner of a 25 
mile ride goes only 5.86 mph, a reasonable person will think, 
"Can that be right?"  So you look at the 50 that was run 
concurrently.  That ride was won in 5:34, an average of 8.98 
mph.  That is slow too but not outlandishly so.  I think 
Truman said the 50s run about 15% faster than the LDs.  In 
this ride the record says they ran 53% faster.


This could happen. For example, my wife was in the middle of the top-10 on a
30 this year, and it wasn't because she was going fast, it was because the
really fast people just weren't in the LD on that ride. We could also have a
ride where the hardest loop is only done by the LD riders, or the hardest
loop is only done by the 50's. The expectation is that over 100's of rides,
this will average out. This is why I wanted hundreds of samples.

That's statistics for you - we see that LD vs. 50's explains some of the
variability in winning ride speeds. There's more that isn't explained -
possible factors include terrain, temperature, one ride being easier or
harder on average than the other, and who happened to show up that day. And
indeed some of the differences may be explained by ride manager error, but
from my experience, this is rare.

No matter how carefully you do a study, if the data is 
flawed, the study results are unreliable.  In the case of the 
AERC LD times, the question is not IF the data is flawed, it 
is HOW MUCH is the data is flawed.


Data are always flawed, and results of an analysis are always unreliable.
The key is to understand how data may be flawed, and how unreliable the
analysis is. In statistics, we're normally happy if we're 95% confident in
the result. There are various tests we can do to determine if 2 populations
are really different, or if it's just chance.

So for example, the math that allows me to design an aircraft wing has some
compromises (for reasons I won't go into). The net result is that the
predicted lift and other characteristics are within about 10% of actual
behavior, which is good enough to get us into the wind tunnel where we can
find out how it really works. Now that we have better computer simulations,
we can do better than that, so this is just an example.

A good indicator that my analysis is correct is that it was performed on
data collected in the late 90's. Two more recent analysis's conducted on
newer data come to the same conclusion, and believe me, Heidi isn't an
especially enthusiastic fan of LD, to put it mildly. It's pretty rare for me
to be riding LD these days, too.



=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

Ridecamp is a service of Endurance Net, http://www.endurance.net.
Information, Policy, Disclaimer: http://www.endurance.net/Ridecamp
Subscribe/Unsubscribe http://www.endurance.net/ridecamp/logon.asp

Ride Long and Ride Safe!!

=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

Replies
: Re: [RC] LD/50 times, Diane Trefethen

Previous by date: [RC] South Africa riders travelling in USA - Steph Teeter
Next by date: RE: [RC] LD/50 racing vs none racing times - David LeBlanc
Previous by thread: Re: [RC] LD/50 times
Next by thread: Re: [RC] LD/50 times