subject
Business, 30.03.2020 21:17 aguilarjose

A university is applying classification methods in order to identify alumni who may be interested in donating money. The university has a database of 58,205 alumni profiles containing numerous variables. Of these 58,205 alumni, only 576 have donated in the past. The university has oversampled the data and trained a random forest of 100 classification trees. For a cutoff value of 0.5, the following confusion matrix summarizes the performance of the random forest on a validation set:

Predicted

Actual

Donation

No Donation

Donation

268

20

No Donation

5375

23439

The following table lists some information on individual observations from the validation set:

Observation ID

Actual Class

Probability of Donation

Predicted Class

A

Donation

0.8

Donation

B

No Donation

0.1

No Donation

C

No Donation

0.6

Donation

a. Explain how the probability of Donation was computed for the 3 observations. Why were observations A and C classified as Donation and observation B was classified as No Donation?

b. Compute the values of accuracy, true positive rate (TPR), false positive rate (FPR) and precision. Evaluate the performance of the classifier, particularly comment on the precision measure taking into account the natural occurrence of donors in the data set.

ansver
Answers: 2

Other questions on the subject: Business

image
Business, 21.06.2019 22:50, chloespalding
Assume that the governance committee states that all projects costing more than $70,000 must be reviewed and approved by the chief information officer and the it senior leadership team (slt). at this point, the cio has the responsibility to ensure that management processes observe the governance rules. for example, the project team might present the proposed project in an slt meeting for a vote of approval. what does this scenario illustrate about organizational structure?
Answers: 2
image
Business, 22.06.2019 06:10, brooke0713
Amanda works as an industrial designer
Answers: 1
image
Business, 22.06.2019 17:40, bsheepicornozj0gc
Within the relevant range, if there is a change in the level of the cost driver, then a. total fixed costs will remain the same and total variable costs will change b. total fixed costs will change and total variable costs will remain the same c. total fixed costs and total variable costs will change d. total fixed costs and total variable costs will remain the same
Answers: 3
image
Business, 22.06.2019 22:30, jasjas3722
Which of the following describes one of the ways that the demographics of an area affect the price of housing in that area? a. when more people have children, their incomes tend to be higher and the housing prices are also higher. b. older people are more likely to stay in their houses, creating a seller's market that keeps prices low. c. an area with a lower population density won't have enough construction workers to build new houses quickly. d. an area with younger people will have a higher demand for rentals and a lower demand for buying.
Answers: 1
You know the right answer?
A university is applying classification methods in order to identify alumni who may be interested in...

Questions in other subjects:

Konu
Mathematics, 23.02.2021 01:00
Konu
Mathematics, 23.02.2021 01:00