TryHackMe: Advent Of Cyber 2023 - Day 15
Over the past few weeks, Best Festival Company employees have been receiving an excessive number of spam emails. These emails are trying to lure users into the trap of clicking on links and providing credentials. Spam emails are somehow ending up in the mailing box. It looks like the spam detector in place since before the merger has been disabled/damaged deliberately. Suspicion is on McGreedy, who is not so happy with the merger.
Problem Statement
McSkidy has been tasked with building a spam email detector using Machine Learning (ML). She has been provided with a sample dataset collected from different sources to train the Machine Learning model.
Questions
What is the key first step in the Machine Learning pipeline?
- Data Collection
Which data preprocessing feature is used to create new features or modify existing ones to improve model performance?
- Feature Engineering
During the data splitting step, 20% of the dataset was split for testing. What is the percentage weightage avg of precision of spam detection?
- 0.98
How many of the test emails are marked as spam?
- 3
One of the emails that is detected as spam contains a secret code. What is the code?
- I_HaTe_BesT_FestiVal
If you enjoyed this room, please check out the Phishing module.
- No answer needed