Information score#
The information score \(Inf(e)\) measures the degree to which the distribution of an expert is concentrated and is defined as
where \(m\) is the total number of questions and \(I_j(e)\) is given by
where \(L^* = L - k(U-L)\) and and \(U^* = U + k(U-L)\), with \(L\) and \(U\) being the lowest and highest value, respectively, obtained from the different experts and the realization if available and \(k=0.1\).
“Interpreting Information Score”
A high score of \(Inf(e)\) means that the expert distribution is concentrated in a small region. Therefore, it is considered more informative.
Let’s see it with an example#
Let’s compute the information score for expert E using the 8 seed questions in the following table.
Question |
Realization |
5% |
50% |
95% |
|---|---|---|---|---|
1 |
25 |
10 |
40 |
60 |
2 |
0.0011 |
0.002 |
0.02 |
0.05 |
3 |
214 |
5 |
80 |
150 |
4 |
10840 |
30000 |
60000 |
200000 |
5 |
1 |
5 |
13 |
30 |
6 |
6 |
2 |
15 |
50 |
7 |
20 |
30 |
45 |
60 |
8 |
0.23 |
3 |
8 |
12 |
First, we compute the \(L^*\) and \(U^*\). Note that we assume we only have one expert when computing the support, \(L^*\) and \(U^*\). If we had more than one expert, they should be considered in their computation.
\(L = [10, 0.0011, 5, 10840, 1, 2, 20, 0.23]\)
\(U = [60, 0.05, 214, 200000, 30, 50, 60, 12]\)
\(L^* = L - k(U-L) = L - 0.1 (U - L) = [5, -0.00379, -15.9, -8076, -1.9, -2.8, 16, -0.947]\)
\(U^* = U + k(U-L) = L - 0.1 (U - L) = [65, 0.0549, 234.9, 218916, 32.9, 54.8, 64, 13.18]\)
Using the computed support, we can compute \(I_j(E)\) for each question as
And similarly for the missing questions obtaining \(I_2(E)=0.055\), \(I_3(E)=0.278\), \(I_4(E)=0.323\), \(I_5(E)=0.171\), \(I_6(E)=0.124\), \(I_7(E)=0.214\), \(I_8(E)=0.205\).
Finally, we can compute \(Inf(E)\) as
Note that the Information Score is not upper bounded in 1. The value fo the information score for expert E is quite low.
It’s your turn now!#
Compute the information score assuming we have two experts A and E. You have the answers for expert A in the following table. Which expert is more informative when comparing expert A and E? Note that you need to requantify the Information Score of expert E with the new support.
Question |
Realization |
5% |
50% |
95% |
|---|---|---|---|---|
1 |
25 |
1 |
6 |
20 |
2 |
0.0011 |
\(10^{-9}\) |
\(10^{-4}\) |
0.002 |
3 |
214 |
5 |
150 |
900 |
4 |
10840 |
1000 |
10000 |
65000 |
5 |
1 |
5 |
65 |
95 |
6 |
6 |
0.1 |
6 |
50 |
7 |
20 |
2 |
40 |
99 |
8 |
0.23 |
0.1 |
5 |
40 |
Solution
\(L^* = [-4.9, -0.005, -84.5, -18900, -8.4, -4.89, -7.7, -3.89]\)
\(U^* = [65.9, 0.055, 989.5, 219900, 104.4, 54.99, 108.7, 43.99]\)
For expert A and E, respectively:
\(I_j(A)=[1.05, 3.55,0.29,1.26,0.093,0.41,0.04,0.4]\)
\(I_j(E)=[0.14, 0.067, 1.54,0.36,1.15,0.14,0.93,1.16]\)
And finally the Information Scores for each expert:
\(Inf(A) = 0.89\)
\(Inf(E) = 0.69\)
Therefore, expert A is more informative than expert E, although the difference is not too large.