Question: Your data science team wants to use machine learning to better filter out spam messages. The team has gathered a database of 100,000 messages that have been identified as spam or not spam. If you are using supervised machine learning, what would you call this data set?

  1. machine learning algorithm
  2. training set
  3. big data test set
  4. data cluster

Answer: The correct answer of the above question is Option B:training set