subject
Mathematics, 28.03.2020 06:10 kyliegriffis

Computer software is commonly used to translate text from one language to another. As part of his Ph. D. thesis, Philipp Koehn developed a phrase-based translation program called Pharaoh. The quality of the translation can vary. A good translation system should match a professional human translation. It is important to be able to quantify how good the translations produced by Pharaoh are. The IBM T. J. Watson Research Center developed methods to measure the quality of a translation from one language to another. One of these is the BiLingual Evaluation Understudy (BLEU). BLEU is a score ranging from 0 to 1 that indicates how well a computer translation matches a professional human translation of the same text. Higher scores indicate a better match. BLEU helps companies who develop translation software "to monitor the effect of daily changes to their systems in order to weed out bad ideas from good ideas." To compare Pharaoh's ability to translate with similar computer translation software, Koehn took a random sample of 100 blocks of Spanish text, each of which contained 300 sentences, and used Pharaoh to translate each of these to English. The BLEU score was calculated for each of the 100 blocks. He wants to use this data to see if it differs from the mean BLEU score of another leading translation software which has a population mean score of 0.295. Open the data file BLEU-Scores.

1. . Assuming the requirements are satisfied, calculate a 95% confidence interval for the mean of the BLEU test scores.

2. Calculate the degrees of freedom and the test statistic for a test of H0:μ=0.295H0:μ=0.295 against Ha:μ≠0.295Ha:μ≠0.295. Assume the requirements are satisfied.

3. Calculate the `P`-value for a test of H0:μ=0.295H0:μ=0.295 against Ha:μ≠0.295Ha:μ≠0.295. Assume the requirements are satisfied.

4. Based on the results of this test, what would you conclude? Use a level of significance of α=0.05α=0.05

A. We have sufficient evidence to say that the true mean is equal to 0.295.
B. We have insufficient evidence to say that the true mean is equal to 0.295.
C. We have insufficient evidence to say that the true mean is different than 0.295.
D. We have sufficient evidence to say that the true mean is different than 0.295.

ansver
Answers: 1

Other questions on the subject: Mathematics

image
Mathematics, 21.06.2019 17:00, MrKrinkle77
Igor stravinsky tires sells approximately 3,760,000 car tires and 1,200,000 truck tires each year. about 47,000,000 care tires and 26,000,00 truck tires are sold each year in the united states. what is stravinsky's market share in each of these two markets (cars and trucks)?
Answers: 1
image
Mathematics, 21.06.2019 20:00, desereemariahha
Someone answer asap for ! the following statements are true about the coins calvin and sasha have collected. * calvin and sasha has the same amount of money. * calvin has only quarters. * sasha has dimes, nickels, and pennies * calvin has the same number of quarters as sasha has dimes. * sasha has $1.95 in coins that are not dimes. exactly how many quarters does calvin have?
Answers: 3
image
Mathematics, 21.06.2019 20:30, BAJRY
Lola says these two expressions have the same value. expression a expression b which explains whether lola is correct?
Answers: 2
image
Mathematics, 21.06.2019 21:50, roxanneee2145
5. which description does not guarantee that a quadrilateral is a squar ajo is a parallelogram with perpendicular diagonals 0% has all sides congruent and all angles congruent o has all right angles and has all sides congruent 10% is both a rectangle and a rhombus 30%
Answers: 2
You know the right answer?
Computer software is commonly used to translate text from one language to another. As part of his Ph...

Questions in other subjects:

Konu
Biology, 11.07.2019 05:30
Konu
Mathematics, 11.07.2019 05:30