Search This Blog

Tuesday, May 8, 2012

Report on Statistical Analysis of Cricketers











Report on Statistical Analysis of Cricketers


Table of Contents




















INTRODUCTION TO SPSS





SPSS is among the most widely used programs for statistical analysis in social science and it helps organizations predict future events and proactively act upon that insight to drive better business outcomes.

We have done the statistical analysis of the cricketers for this we have chosen the four world class batsmen. These batsmen are
  • Brian Charles Lara
  • Mohammad Yousuf
  • Ricky Thomas Ponting
  • Sachin Ramesh Tendulkar

BOX PLOT


In descriptive statistics, a box plot or boxplot (also known as a box-and-whisker diagram or plot) is a convenient way of graphically depicting groups of numerical data through their five-number summaries: the smallest observation (sample minimum), lower quartile (Q1), median (Q2), upper quartile (Q3), and largest observation (sample maximum). A boxplot may also indicate which observations, if any, might be considered outliers.
Boxplots display differences between populations without making any assumptions of the underlying statistical distribution.


MEAN


In statistics, the mean is the mathematical average of a set of numbers. The average is calculated by adding up two or more scores and dividing the total by the number of scores

MEDIAN


In probability theory and statistics, a median is described as the numerical value separating the higher half of a sample, a population, or a probability distribution, from the lower half. The median of a finite list of numbers can be found by arranging all the observations from lowest value to highest value and picking the middle one. If there is an even number of observations, then there is no single middle value; the median is then usually defined to be the mean of the two middle values

MODE


In statistics, the mode is the value that occurs most frequently in a data set or a probability distribution.[1] In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score

STANDARD DEVIATION


Standard deviation is a widely used measurement of variability or diversity used in statistics and probability theory. It shows how much variation or "dispersion" there is from the average (mean, or expected value). A low standard deviation indicates that the data points tend to be very close to the mean, whereas high standard deviation indicates that the data are spread out over a large range of values.

VARIANCE


In probability theory and statistics, the variance is used as a measure of how far a set of numbers are spread out from each other. It is one of several descriptors of a probability distribution, describing how far the numbers lie from the mean (expected value). In particular, the variance is one of the moments of a distribution. In that context, it forms part of a systematic approach to distinguishing between probability distributions.

PERCENTILE


In statistics, a percentile (or centile) is the value of a variable below which a certain percent of observations fall. For example, the 20th percentile is the value (or score) below which 20 percent of the observations may be found.

SKEWNESS


In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable. The skewness value can be positive or negative, or even undefined. Qualitatively, a negative skew indicates that the tail on the left side of the probability density function is longer than the right side and the bulk of the values (possibly including the median) lie to the right of the mean. A positive skew indicates that the tail on the right side is longer than the left side and the bulk of the values lie to the left of the mean. A zero value indicates that the values are relatively evenly distributed on both sides of the mean, typically but not necessarily implying a symmetric distribution.

HISTOGRAM


In statistics, a histogram is a graphical representation, showing a visual impression of the distribution of data. It is an estimate of the probability distribution of a continuous variable and was first introduced by Karl Pearson. A histogram consists of tabular frequencies, shown as adjacent rectangles, erected over discrete intervals (bins), with an area equal to the frequency of the observations in the interval. The height of a rectangle is also equal to the frequency density of the interval, i.e., the frequency divided by the width of the interval. The total area of the histogram is equal to the number of data. A histogram may also be normalized displaying relative frequencies. It then shows the proportion of cases that fall into each of several categories, with the total area equaling 1. The categories are usually specified as consecutive, non-overlapping intervals of a variable. The categories (intervals) must be adjacent, and often are chosen to be of the same size.

RANDOM SAMPLE


In statistics, a sample is a subject chosen from a population for investigation; a random sample is one chosen by a method involving an unpredictable component. Random sampling can also refer to taking a number of independent observations from the same probability distribution, without involving any real population. The sample usually is not a representative of the population from which it was drawn— this random variation in the results is termed as sampling error. In the case of random samples, mathematical theory is available to assess the sampling error. Thus, estimates obtained from random samples can be accompanied by measures of the uncertainty associated with the estimate. This can take the form of a standard error, or if the sample is large enough for the central limit theorem to take effect, confidence intervals may be calculated.














STATISTICAL ANALYSIS:


We have done the statistical analysis of the cricketers for this we have chosen the four world class batsmen. These batsmen are
  • Brian Charles Lara
  • Mohammad Yousuf
  • Ricky Thomas Ponting
  • Sachin Ramesh Tendulkar

We have done the statistical analysis of their career records that include:
  • Batting averages
  • Highest scores
  • Strike rates
  • Centuries
  • Half centuries
  • Total runs scored
  • Total matches played
We have taken only their test matches record. And their career records are given below.

Sachin Ramesh Tendulkar:


Full name: Sachin Ramesh Tendulkar
Born: April 24, 1973, Bombay (now Mumbai), Maharashtra
Current age: 38 years 87 days
Major teams: India, Asia XI, Mumbai, Mumbai Indians,Yorkshire
Playing role: Top-order batsman
Batting style: Right-hand bat
Bowling style: Right-arm offbreak, Legbreak googly
Height: 5 ft 5 in











Batting and fielding averages



Mat
Inns
NO
Runs
HS
Ave
BF
SR
100
50
4s
6s
Ct
St
Tests
177
290
32
14692
248*
56.94


51
59

64
106
0
ODIs
453
442
41
18111
200*
45.16
20980
86.32
48
95
1981
193
136
0
T20Is
1
1
0
10
10
10.00
12
83.33
0
0
2
0
1
0
First-class
281
443
48
23611
248*
59.77


78
105


174
0
List A
540
527
55
21663
200*
45.89


59
113


171
0


Descriptive Statistics

N
Range
Minimum
Maximum
Mean

Statistic
Statistic
Statistic
Statistic
Statistic
Std. Error
Sachin
260
217.00
.00
217.00
45.8654
2.99621
Valid N (listwise)
260








Descriptive Statistics

Std. Deviation
Variance
Skewness

Statistic
Statistic
Statistic
Std. Error
Sachin
48.31246
2334.094
1.431
.151


Case Processing Summary

Cases

Valid
Missing
Total

N
Percent
N
Percent
N
Percent
Sachin
260
84.1%
49
15.9%
309
100.0%



Descriptive



Statistic
Std. Error
Sachin

Mean
45.8654
2.99621
95% Confidence Interval for Mean
Lower Bound
39.9653

Upper Bound
51.7654


5% Trimmed Mean
40.8718

Median
31.0000

Variance
2334.094

Std. Deviation
48.31246

Minimum
.00

Maximum
217.00

Range
217.00

Interquartile Range
58.75

Skewness
1.431
.151
Kurtosis
1.558
.301










Percentiles


Percentiles


5
10
25
50
75
Weighted Average(Definition 1)
Sachin
.0000
2.0000
9.0000
31.0000
67.7500
Tukey's Hinges
Sachin


9.0000
31.0000
67.5000


Percentiles


Percentiles


90
95
Weighted Average(Definition 1)
Sachin
115.8000
154.9000
Extreme Values



Case Number
Value
Sachin
Highest
1
116
217.00
2
295
214.00
3
290
203.00
4
167
193.00
5
52
179.00
Lowest
1
252
.00
2
228
.00
3
186
.00
4
182
.00
5
160
.00a
a. Only a partial list of cases with the value .00 is shown in the table of lower extremes.










Ricky Thomas Ponting:


Full name: Ricky Thomas Ponting
Born: December 19, 1974, Launceston, Tasmania
Current age: 36 years 213 days
Major teams: Australia, ICC World XI, Kolkata Knight Riders,Somerset, Tasmania
Playing role: Top-order batsman
Batting style: Right-hand bat
Bowling style: Right-arm medium
Height: 1.78 m

Batting and fielding averages



Mat
Inns
NO
Runs
HS
Ave
BF
SR
100
50
4s
6s
Ct
St
Tests
152
259
28
12363
257
53.51
20827
59.36
39
56
1406
72
178
0
ODIs
362
352
38
13406
164
42.69
16633
80.59
30
79
1195
161
155
0
T20Is
17
16
2
401
98*
28.64
302
132.78
0
2
41
11
8
0
First-class
255
436
55
21332
257
55.98


73
94


270
0
List A
434
424
51
15762
164
42.25


34
94


187
0
Twenty20
22
21
2
460
98*
24.21
375
122.66
0
2
44
13
10
0


Descriptive Statistics

N
Range
Minimum
Maximum
Mean

Statistic
Statistic
Statistic
Statistic
Statistic
Std. Error
Ponting
152
298.00
.00
298.00
81.3355
5.47769
Valid N (list wise)
152








Descriptive Statistics

Std. Deviation
Variance
Skewness

Statistic
Statistic
Statistic
Std. Error
Ponting
67.53345
4560.767
1.141
.197


Case Processing Summary

Cases

Valid
Missing
Total

N
Percent
N
Percent
N
Percent
Ponting
152
49.2%
157
50.8%
309
100.0%


Descriptives



Statistic
Std. Error
Ponting

Mean
81.3355
5.47769
95% Confidence Interval for Mean
Lower Bound
70.5127

Upper Bound
92.1583


5% Trimmed Mean
75.7208

Median
61.5000

Variance
4560.767

Std. Deviation
67.53345

Minimum
.00

Maximum
298.00

Range
298.00

Interquartile Range
94.50

Skewness
1.141
.197
Kurtosis
.847
.391


Percentiles


Percentiles


5
10
25
50
75
Weighted Average(Definition 1)
Ponting
6.6500
11.0000
28.5000
61.5000
123.0000
Tukey's Hinges
Ponting


29.0000
61.5000
123.0000



Extreme Values



Case Number
Value
Ponting
Highest
1
142
298.00
2
74
288.00
3
100
263.00
4
106
256.00
5
95
253.00
Lowest
1
40
.00
2
30
.00
3
29
.00
4
26
1.00
5
12
4.00


















Mohammad Yousuf:



Full name: Mohammad Yousuf
Born: August 27, 1974, Lahore, Punjab
Current age: 36 years 327 days
Major teams: Pakistan, Asia XI, Bahawalpur, Lahore,Lahore Badshahs, Lancashire, Pakistan International Airlines,Warwickshire, Water and Power Development Authority,Zarai Taraqiati Bank Limited
Batting style: Right-hand bat
Bowling style: Right-arm offbreak

Batting and fielding averages



Mat
Inns
NO
Runs
HS
Ave
BF
SR
100
50
4s
6s
Ct
St
Tests
90
156
12
7530
223
52.29
14372
52.39
24
33
957
51
65
0
ODIs
288
273
40
9720
141*
41.71
12942
75.10
15
64
785
90
58
0
T20Is
3
3
0
50
26
16.66
43
116.27
0
0
5
1
1
0
First-class
141
239
20
10505
223
47.96


30
51


84
0
List A
338
322
47
11026
141*
40.09


15
75


70
0
Twenty20
23
20
2
357
57*
19.83
322
110.86
0
1
37
8
9
0


Descriptive Statistics

N
Range
Minimum
Maximum
Mean

Statistic
Statistic
Statistic
Statistic
Statistic
Std. Error
Yousuf
89
250.00
.00
250.00
84.6067
6.56364
Valid N (listwise)
89









Descriptive Statistics

Std. Deviation
Variance
Skewness

Statistic
Statistic
Statistic
Std. Error
Yousuf
61.92125
3834.241
.862
.255

Case Processing Summary

Cases

Valid
Missing
Total

N
Percent
N
Percent
N
Percent
Yousuf
89
28.8%
220
71.2%
309
100.0%


Descriptive



Statistic
Std. Error
Yousuf

Mean
84.6067
6.56364
95% Confidence Interval for Mean
Lower Bound
71.5629

Upper Bound
97.6506


5% Trimmed Mean
80.7828

Median
72.0000

Variance
3834.241

Std. Deviation
61.92125

Minimum
.00

Maximum
250.00

Range
250.00

Interquartile Range
91.00

Skewness
.862
.255
Kurtosis
.047
.506




Percentiles


Percentiles


5
10
25
50
75
Weighted Average(Definition 1)
Yousuf
9.0000
16.0000
33.0000
72.0000
124.0000
Tukey's Hinges
Yousuf


34.0000
72.0000
124.0000




Percentiles


Percentiles


90
95
Weighted Average(Definition 1)
Yousuf
191.0000
213.5000


Extreme Values



Case Number
Value
Yousuf
Highest
1
67
250.00
2
72
247.00
3
73
226.00
4
62
223.00
5
35
204.00
Lowest
1
19
.00
2
9
3.00
3
1
6.00
4
46
8.00
5
90
10.00












0 comments:

Twitter Delicious Facebook Digg Stumbleupon Favorites More