Kurskod: TAMS11 Provkod: TENB 12 January 2015, 08:00-12:00. English Version

Kurskod: TAMS Provkod: TENB 2 January 205, 08:00-2:00 Examiner: Xiangfeng Yang (Tel: 070 2234765). Please answer in ENGLISH if you can. a. You are allowed to use: a calculator; formel -och tabellsamling i matematisk statistik (from MAI); TAMS : Notations and Formulae (by Xiangfeng Yang), a dictionary. b. Scores rating: 8- points giving rate 3;.5-4.5 points giving rate 4; 5-8 points giving rate 5. (3 points) English Version Two random variables X and Y have a joint probability mass function as follows X\Y 0 00 200 00 0.20 0.0 0.20 250 0.05 0.5 0.30 The table tells that X can take values 00 and 250, and Y can take values 0, 00 and 200. (.). (p) Find P (X = 250) and P (Y = 200). (.2). (p) Find the expected value µ = E(Y ) of the random variable Y. (.3). (p) Are X and Y independent? Why? Solution. (.). P (X = 250) = 0.05 + 0.5 + 0.3 = 0.5, and P (Y = 200) = 0.2 + 0.3 = 0.5. (.2). The expected value µ = E(Y ) = 0 0.25 + 00 0.25 + 200 0.5 = 25. (.3). Form (.) we know that P (X = 250) = 0.5 and P (Y = 200) = 0.5, thus Therefore, X and Y are NOT independent. 0.3 = P (X = 250, Y = 200) P (X = 250) P (Y = 200) = 0.25. 2 (3 points) Suppose that the mass X of a certain particle has a probability density function f(x) = x 0, 0 x 70. c (2.). (p) Find the value of the constant c =? (2.2). (p) Find the expected value µ = E(X) of X. (2.3). (p) Find the conditional probability P (X > 30 X < 50). Solution. (2.). From the definition of a probability density function we know that (2.2). = µ = E(X) = f(x)dx = 70 0 xf(x)dx = x 0 dx = 800, thus c = 800. c c 70 0 x 2 0x dx = 90000 800 800 = 50.dx Page /3

(2.3). P (X > 30 X < 50) = P (30 < X < 50) P (X < 50) = 50 x 0 30 800 dx 50 x 0 0 800 600/800 = = 3/4 = 0.75. dx 800/800 3 (3 points) There are 40 students in an elementary statistics class. On the basis of years of experience, the instructor knows that the time needed to grade a randomly chosen first examination paper is a random variable with an expected value of 6 min and a standard deviation of 6 min. If grading times are independent and the instructor begins grading at 6:50 P.M. and grades continuously, what is the probability that he is through grading before the :00 P.M. TV news begins? (Hint: apply Central Limit Theorem). Solution. Let X j = grading time for the j-th examination, j =,..., 40. From the conditions we know that X,..., X 40 are independent and all have a mean µ = E(X j ) = 6 and a standard deviation σ = 6. We find the probability as follows P (he is through grading before the :00 P.M) = P (X +... + X 40 250 minutes) = P ( X +... + X 40 40 = P ( X 6 6/ 40 25/4 6 6/ 40 ) = P (N(0, ) 0.26) = 0.6026. 250 40 ) = P ( X 25/4) 4 (3 points) Suppose that a population X is a continuous random variable having a probability density function f(x) = θ +, 0 x θ +, with an unknown parameter θ > 0. There is a sample {x,..., x n } from this population. (4.). (p) Find a point estimate ˆθ MM of θ using Method of Moments. (4.2). (2p) Find a point estimate ˆθ ML of θ using Maximum-Likelihood method. Solution. (4.). For Method of Moments, the first equation is E(X) = x. The mean E(X) can be calculated as E(X) = xf(x) = θ+ 0 x θ + dx = (θ + ). 2 By solving E(X) = x, we have θ = 2 x which yields ˆθ MM = 2 x. (4.2). For the Maximum-Likelihood method, we write the likelihood function as L(θ) = f(x ) f(x 2 )... f(x n ) = Maximizing L(θ) is equivalent to maximize ln L(θ) where d ln L(θ) (θ + ) n, for x θ +,..., x n θ +. ln L(θ) = n ln(θ + ). By looking at the first derivative dθ = n/(θ + ) < 0, the function ln L(θ) is decreasing. This means that the least value of θ will give a maximal value of the function ln L(θ). But what is the lease value of θ? In L(θ) we had x θ +,..., x n θ +, thus θ max {x,..., x n }. Therefore (no need to check the second derivative of ln L(θ).) ˆθ ML = max {x,..., x n }. Page 2/3

5 (3 points) measurements of the same item have resulted in the following values: 5.4, 3.76, 5.09, 5.87, 6.33, 4.03, 6.25, 5.57, 3.28, 5.2, 5.66, 5.0, 4.63, 5.74, 4.20, 4.69. The average of the data is x = 5.03, and the standard deviation of the data is s = 0.885. Assume that the observations are independent and from a population N(µ, σ 2 ). (5.). (p) If σ is unknown, find a 95% confidence interval of µ. (5.2). (p) If σ = is known, find a 95% confidence interval of µ. (5.3). (p) If σ is unknown, find a 95% confidence interval of σ. Solution. (5.). Since σ is unknown, a 95% confidence interval of µ would be I µ = x t α/2 (n ) s n = 5.03 t 0.025 ( ) 0.885 = 5.03 2.3 0.2225 = 5.03 0.47 = (4.56, 5.5). (5.2). Since σ is known σ =,, a 95% confidence interval of µ would be I µ = x z α/2 σ n = 5.03 z 0.025 = 5.03.96 (5.3). A 95% confidence interval of σ 2 would be ( ) (n )s 2 (n )s 2 ( ( )0.885 2 I σ 2 = χ 2 α/2 (n ), = (n ) χ 2 0.025 ( ), ( )0.885 2 ) χ 2 = 0.975 ( ) χ 2 α/2 = 5.03 0.49 = (4.54, 5.52). ( ).75 27.5,.75 = (0.428,.877). 6.26 Thus a 95% confidence interval of σ is I σ = ( 0.428,.877) = (0.654,.37). 6 (3 points) Extensive experience shows that if a certain disease is treated in the traditional way, then the probability p that the patient recovers is only 0.6. Now there is a new medicine against the disease. In a treatment experiment, there were 68 recovered out of 00 patients using this new medicine. (6.). (p) Test the following hypotheses with a significance level α = 0.05 : H 0 : p = 0.6 versus H a : p > 0.6. (6.2). (2p) For the test in (6.), what is the probability of not concluding that p > 0.6 when the actual p = 0.8? Solution. (6.). T S = ˆp p 0 = 68/00 0.6 = 0.08/0.049 =.63, and the rejection region p0( p 0)/n 0.6( 0.6)/00 C = (z α, ) = (.645, ). Since T S / C, we do NOT reject H 0. (6.2). This is a Type II error, namely β(0.8) = P (don t reject H 0 when H 0 is wrong and p = 0.8) ˆp p 0 = P ( <.645 when p = 0.8) p0 ( p 0 )/n ˆp p 0 (need to change p0 ( p 0 )/n to ˆp p ˆp p since N(0, )) p( p)/n p( p)/n = P ( ˆp p + p p 0 p( p)/n ˆp p = P ( <.645 p( p)/n p( p) <.645 when p = 0.8) p 0 ( p 0 ) p 0 ( p 0 ) p( p) p p 0 p( p)/n when p = 0.8) = P (Z < 2.985) = 0.9986 = 0.004. Page 3/3

Kurskod: TAMS Provkod: TENB 2 januari 205, kl. 8-2 Examinator: Xiangfeng Yang (Tel: 070 2234765). Vänligen svara på ENGELSKA om du kan. a. Tillåtna hjälpmedel är: en räknare; formel -och tabellsamling i matematisk statistik (från MAI); TAMS : Notations and Formulae (by Xiangfeng Yang); en ordbok. b. Betygsgränser: 8- poäng ger betyg 3;.5-4.5 poäng ger betyg 4; 5-8 poäng ger betyg 5. (3 poäng) Svensk Version Två stokastiska variabler X och Y har sannolikhetsfunktionen X\Y 0 00 200 00 0.20 0.0 0.20 250 0.05 0.5 0.30 Tabellen berättar att X kan ha värden 00 och 250, och Y kan ha värden 0, 00 och 200. (.). (p) Beräkna P (X = 250) och P (Y = 200). (.2). (p) Beräkna väntevärdet µ = E(Y ) för den stokastiska variabeln Y. (.3). (p) Är X och Y oberoende? Varför? 2 (3 poäng) Antag att mass X av en viss partikel har täthetsfunktionen f(x) = x 0, 0 x 70. c (2.). (p) Beräkna värdet på konstanten c =? (2.2). (p) Beräkna väntevärdet µ = E(X) för X. (2.3). (p) Beräkna den betingade sannolikheten P (X > 30 X < 50). 3 (3 poäng) Det finns 40 elever i en elementär statistik klass. På grundval av många års erfarenhet, vet instruktören att den tid som behövs för att gradera en slumpmässigt vald första tentamen är en stokastisk variabel med ett väntevärd på 6 min och en standardavvikelse på 6 min. Om betygstider är oberoende och instruktören börjar gradering på 6:50 P.M. och graderar kontinuerligt, vad är sannolikheten att han avslutar gradering innan :00 P.M. TV-nyheter börjar? (Ledning: använd centrala gränsvärdessatsen). 4 (3 poäng) Antag att fördelningen för en population X har täthetsfunktionen f(x) = θ +, 0 x θ +, med en okänd parameter θ > 0. Det finns ett stickprov {x,..., x n } från denna population. (4.). (p) Hitta en punktskattning ˆθ MM av θ genom att använda momentmetoden. (4.2). (2p) Hitta en punktskattning ˆθ ML av θ genom att använda Maximum Likelihood-metoden. Page /2

5 (3 poäng) Man har gjort upprepade oberoende mätningar av samma storhet och erhållit följande mätvärden: 5.4, 3.76, 5.09, 5.87, 6.33, 4.03, 6.25, 5.57, 3.28, 5.2, 5.66, 5.0, 4.63, 5.74, 4.20, 4.69. Observationernas medelvärde är x = 5.03, och observationernas standardavvikelse är s = 0.885. Antag att observationerna är oberoende och från en population N(µ, σ 2 ). (5.). (p) Om σ är okänd, finn ett 95% konfidensintervall för µ. (5.2). (p) Om σ = är känd, finn ett 95% konfidensintervall för µ. (5.3). (p) Om σ är okänd, find a 95% confidence interval of σ. 6 (3 poäng) Lång erfarenhet visar att om en viss sjukdom behandlas på traditionellt sätt, så är sannolikheten p att patienten tillfrisknar bara 0.6. I en inledande studie för en ny medicin mot den aktuella sjukdomen har man behandlat 00 patienter och 68 av dem blev friska. (6.). (p) Pröva på nivån α = 0.05 : H 0 : p = 0.6 mot H a : p > 0.6. (6.2). (2p) För testet i (6.), vad är sannolikheten att inte dra slutsatsen att p > 0.6 men p = 0.8? Page 2/2