Варіант статистичних змін у двох відбіркових форматах Формули 1

15

Я щойно прочитав цю статтю BBC про відбірковий формат у Формулі 1.

Організатори бажають зробити класифікацію менш передбачуваною, тобто збільшити статистичну різницю в результаті. Промальовуючи декілька несуттєвих деталей, на даний момент водії займають рейтинг у своєму кращому одиночному колі з (за конкретністю) двох спроб.

Один керівник F1, Жан Тодт, запропонував, щоб класифікація водіїв в середньому за два кола збільшила статистичну різницю, оскільки водії можуть вдвічі більше помилитися. Інші джерела стверджували, що будь-яке усереднення, безумовно, зменшить статистичну різницю.

Чи можемо ми сказати, хто має рацію під розумними припущеннями? Я припускаю, що вона зводиться до відносної дисперсії $\text{mean}(x,y)$ проти , де і - випадкові величини, що представляють два пробігу водія? $\text{min}(x,y)$ $x$ $y$

variance

— безтурботний
джерело

5

Я думаю, що це залежить від розподілу часу на колінах.

Нехай незалежні, однаково розподілені. $X,Y$

Якщо , то $P(X=0)=P(X=1)=\frac{1}{2}$ $Var(\frac{X+Y}{2}) = \frac{1}{8} < Var( \min(X,Y)) = \frac{3}{16}.$
Якщо ж , то $P(X=0) = 0.9, P(X=100)=0.1$ $Var(\frac{X+Y}{2}) = 450 > Var( \min(X,Y)) = 99.$

Це узгоджується з аргументом, згаданим у питанні про помилку (тобто, винятково тривалий час з невеликою ймовірністю). Таким чином, нам слід було б знати розподіл часу в колі, щоб вирішити.

— сандріс
джерело

Цікаво, я думаю, щось подібне працює і для безперервних оборотів. Що саме пішло не так у попередньому доказі?

— innisfree

1

Наскільки я розумію, він стверджує, що при даному

відстань між

і середнім завжди менше відстані між

та

, тому дисперсія середнього має бути меншою, ніж дисперсія

. Це, однак, не випливає:

x \geq y

$x\geq y$

x

$x$

x

$x$

min (x, y)

$\min(x,y)$

min (x, y)

$\min(x,y)$

min (x, y)

$\min(x,y)$ може залишатися незмінно далеко, тоді як середня величина дуже варіюється. Якби доказ базувався на фактичному розрахунку, було б простіше точно вказати місце, де воно піде не так (або перевірити, чи він дійсний).

— Сандріс

2

Не втрачаючи загальності, припустимо, що і що обидві змінні черпають з одного розподілу з певним середнім і дисперсією. $y \leq x$

покращується на на, $\{y,x\}$ $\{x\}$

випадок 1, означає: , $\frac{y-x}{2}$

випадок 2, хв: . $y-x$

Тому середня величина вдвічі впливає на поліпшення (яке визначається дисперсією), ніж мінімальне (для 2 випробувань). Тобто середнє гасить мінливість.

— Джеймс
джерело

Я не переконаний, що це цілком правильно, ви можете надати офіційне пояснення?

— сандріс

2

Ось мій доказ Var [середній]

Для 2 випадкових величин x, y існує співвідношення між їх середнім значенням, max та min.

2 M e a n (x, y) = M i n (x, y) + M a x (x, y)

$2\,Mean(x,y) = Min(x,y) + Max(x,y)$

4 V a r [M e a n] = V a r [M i n] + V a r [M a x] + 2 C o v [M i n, M a x]

$4\,Var[Mean] = Var[Min]+Var[Max]+2\,Cov[Min,Max]$ If we now assume that the distribution is symmetric around the mean then

V a r [M i n (x, y)] = V a r [M a x (x, y)]

$Var[Min(x,y)]=Var[Max(x,y)]$ Then

4 V a r [M e a n] = 2 V a r [M i n] + 2 C o v [M i n, M a x]

$4\,Var[Mean] = 2\,Var[Min] + 2\,Cov[Min,Max]$ and

C o v [M i n, M a x] <= s q r t (V a r [M i n] V a r [M a x]) = V a r [M i n]

$Cov[Min,Max]<=sqrt(Var[Min]Var[Max])=Var[Min]$ Therefore

V a r [M e a n] <= V a r [M i n]

$Var[Mean]<=Var[Min]$ It is also easy to see from this derivation that in order to reverse this inequality you need a distribution with very sharp truncation of the distribution on the negative side of the mean. For example for the exponential distribution the mean has a larger variance than the min.

— sega_sai
джерело

1

Nice question, thank you! I agree with @sandris that distribution of lap times matters, but would like to emphasize that causal aspects of the question need to be addressed. My guess is that F1 wants to avoid a boring situation where the same team or driver dominates the sport year after year, and that they especially hope to introduce the (revenue-generating!) excitement of a real possibility that 'hot' new drivers can suddenly arise in the sport.

That is, my guess is that there is some hope to disrupt excessively stable rankings of teams/drivers. (Consider the analogy with raising the temperature in simulated annealing.) The question then becomes, what are the causal factors at work, and how are they distributed across the population of drivers/teams so as to create persistent advantage for current incumbents. (Consider the analogous question of levying high inheritance taxes to 'level the playing field' in society at large.)

Suppose incumbent teams are maintaining incumbency by a conservative strategy heavily dependent on driver experience, that emphasizes low variance in lap times at the expense of mean lap time. Suppose that by contrast the up-and-coming teams with (say) younger drivers, necessarily adopt a more aggressive (high-risk) strategy with larger variance, but that this involves some spectacular driving that sometimes 'hits it just right' and achieves a stunning lap time. Abstracting away from safety concerns, F1 would clearly like to see some such 'underdogs' in the race. In this causal scenario, it would seem that a best-of-n-laps policy (large $n$ ) would help give the upstarts a boost -- assuming that the experienced drivers are 'set in their ways', and so couldn't readily adapt their style to the new policy.

Suppose, on the other hand, that engine failure is an uncontrollable event with the same probability across all teams, and that the current rankings correctly reflect genuine gradation in driver/team quality across many other factors. In this case, the bad luck of an engine failure promises to be the lone 'leveling factor' that F1 could exploit to achieve greater equality of opportunity--at least without heavy-handed ranking manipulations that destroy the appearance of 'competition'. In this case, a policy that heavily penalizes engine failures (which are the only factor in this scenario not operating relatively in favor of the incumbents) promises to promote instability in rankings. In this case, the best-of-n policy mentioned above would be exactly the wrong policy to pursue.

— David C. Norris
джерело

0

I generally agree with other answers that the average of two runs will have a lower variance, but I believe they are leaving out important aspects underlying the problem. A lot has to do with how drivers react to the rules and their strategies for qualifying.

For instance, with only one lap to qualify, drivers would be more conservative, and therefore more predictable and more boring to watch. The idea with two laps is to allow the drivers to take chances on one to try to get that "perfect lap", with another available for a conservative run. More runs would use up a lot of time, which could also be boring. The current setup might just be the "sweet spot" to get the most action in the shortest time frame.

Also note that with an averaging approach, the driver needs to find the fastest repeatable lap time. With the min approach, the driver needs to drive as fast as possible for only one lap, probably pushing further than they would under the averaging approach.

Це обговорення ближче до теорії ігор. Ваше запитання може отримати кращі відповіді, якщо їх поставити в такому світлі. Тоді можна запропонувати інші прийоми, як, наприклад, можливість водієві скинути час першого кола на користь другого пробігу та, можливо, більш швидкого або повільного часу. І т.д.

Також зауважте, що цього року було зроблено спробу зміни в кваліфікації, яка, як правило, штовхала водіїв в один консервативний круг. https://en.wikipedia.org/wiki/2016_Formula_One_season#Qualifying Результат розглядався як катастрофа і швидко скасовувався.

— Мадденкер
джерело