subject

Z is a summer intern working on spam classification in your company. The dataset consists of 10 million non-spam emails (class 0) and 10 thousand spam emails (class 1). Z considers the following steps of conducting experiments: Step 1: Shuffle the dataset and split it into the train, validation, and test sets.
Step 2: Train logistic regression models on the train set with different hyper-parameters.
Step 3: Identify the best hyper-parameter using the validation set and report the results on the test set in accuracy.

Do you agree with the above experimental setup? If No, what is the major issue? Provide your suggestions in one or two sentences.

ansver
Answers: 1

Other questions on the subject: Computers and Technology

image
Computers and Technology, 24.06.2019 01:30, jadaroyval
Write a program that asks the user to enter the name of an input file. if the file does not exist, the program should prompt the user to enter the file name again. if the user types quit in any uppercase/lowercase combinations, then the program should exit without any further output.
Answers: 3
image
Computers and Technology, 25.06.2019 00:00, ddatsman1730
One difference of input method between most desktop computers and most tablets is the memory the touch screen the speech recognition
Answers: 1
image
Computers and Technology, 25.06.2019 02:00, whocares1819
Lisa is making a presentation for her project and she wants to use the logo of her company in her presentation. which element would she use to make her logo stand out in her presentation? a. gradation b. contrast c. alignment d. balance
Answers: 1
image
Computers and Technology, 25.06.2019 14:00, TayJoker966
Ken paid $12 for two magazines. the cost of each magazine was a multiple of $3. what are the possible prices of the magazines?
Answers: 1
You know the right answer?
Z is a summer intern working on spam classification in your company. The dataset consists of 10 mill...

Questions in other subjects: