Post

TryHackMe: Advent Of Cyber 2023 - Day 15

Over the past few weeks, Best Festival Company employees have been receiving an excessive number of spam emails. These emails are trying to lure users into the trap of clicking on links and providing credentials. Spam emails are somehow ending up in the mailing box. It looks like the spam detector in place since before the merger has been disabled/damaged deliberately. Suspicion is on McGreedy, who is not so happy with the merger.

Problem Statement

McSkidy has been tasked with building a spam email detector using Machine Learning (ML). She has been provided with a sample dataset collected from different sources to train the Machine Learning model.

Questions

What is the key first step in the Machine Learning pipeline?

  • Data Collection

Which data preprocessing feature is used to create new features or modify existing ones to improve model performance?

  • Feature Engineering

During the data splitting step, 20% of the dataset was split for testing. What is the percentage weightage avg of precision of spam detection?

  • 0.98

How many of the test emails are marked as spam?

  • 3

One of the emails that is detected as spam contains a secret code. What is the code?

  • I_HaTe_BesT_FestiVal

If you enjoyed this room, please check out the Phishing module.

  • No answer needed
This post is licensed under CC BY 4.0 by the author.