subject

Please complete the following instructions: 1. Download the Chicago_Crimes_Assign_4500Bal. csv download dataset.

2. Create an Orange workflow that will do the following:

Ingest the Chicago_Crimes_Assign_4500Bal. csv download dataset.

Preprocess the data to retrieve the most relevant 2 features.

Continuize the discrete categorical variables as numerical. See the video for which option to select here.

Create a k-Means module and let the number of clusters be chosen by the Silhouette score.

Create a Silhoutte Plot

Create a Scatter Plot and compare the two features. Color by cluster.

Answer the following questions in a Word document:

How many clusters was your final result produced in? What was the Silhouette score of the most optimal cluster sizes?

What were the 2 features chosen by your preprocessing?

What can you say about the scatter plot produced? Think about how the categorical variables are transformed into numbers. You don't need to know what the values are that are encoded to make observations about the relationships between the variables.

Try to switch to a using few different number of clusters? Look at your scatterplot. Does this make more sense or less?

Using the Silhouette Plot at the same time as the Scatter Plot, how many of each cluster are ranked in the bottom of cohesion? You can highlight them to see them on the scatter plot (see video). What can you say about these data points in each cluster?

3. Open your workflow from the Chicago Crimes Classification Assignment. You can choose the Undersampled or Oversampled version (note the Oversampled version will take longer for the Neural Network to train)

Add the Neural Network model widget.

Configure the Neural Network as follows:

Give it 2 hidden layers of 50 neurons in each layer

Make the Activation function: ReLu

Make the Solver: Adam

Regularization: leave as is at 0.0001

Maximal number of iterations: 100

Ensure replicable training is checked

Connect the Neural Network widget to the training data as an input and the Test & Score as output (see video)

Connect the Neural Network output to the Predict widget.

Compare the new results in Test & Score, Confusion Matrix and ROC Score.

In the Word Document you created for clustering, answer these questions:

List the results in the Word Document.

Did the Neural Network model perform better or worse than the other models?

Why do you think it performed better or worse?

Send me your email for the link!

ansver
Answers: 2

Other questions on the subject: Computers and Technology

image
Computers and Technology, 22.06.2019 14:20, capo9972
Cengagenowv2 is a comprehensive online learning tool. using cengagenowv2, you may access all of the following except: 2. each time you log in, cengagenowv2 automatically performs a system check and informs you if your computer does not meet the cengagenowv2 system requirements. 3. which tab/page allows you to easily track your assignment scores, number of submissions, time spent, as well as the ability view assign
Answers: 3
image
Computers and Technology, 23.06.2019 09:10, djs1671
(328 inc. 448 ind. 480 in25. john has a collection of toy cars. he has 2 red cars, 4 blue cars, 4 black cars, and 6 yellowcars. what is the ratio of red cars to yellow cars? a. 1: 2b. 1: 3c. 1: 626. the net of a right triangular prism is shown below.
Answers: 2
image
Computers and Technology, 23.06.2019 14:30, qveenvslayin
The basic work area of the computer is it screen that you when you first fire up your computer
Answers: 1
image
Computers and Technology, 24.06.2019 14:00, maddi0132
In the microsoft® access® and microsoft excel® programs, the ribbon contains tabs that are divided into with like tools in them. parts groups containers bunches
Answers: 1
You know the right answer?
Please complete the following instructions: 1. Download the Chicago_Crimes_Assign_4500Bal. csv down...

Questions in other subjects:

Konu
Mathematics, 25.08.2020 01:01
Konu
Biology, 25.08.2020 01:01