Compare Proposal

Nothing to compare.

Need Processing IRC LOGS

  • Posted at : 4 months ago
  • Post Similar Project
1000

Budget
4
Proposals
128
Views
Open
Status

Posted By -

IP

0.0
Projects Posted : 6
Projects Paid : 0
Services Purchased : 0
Total Spent :
0
Feedbacks : 0 %

Project Details show (+) hide (-)

You need to analyze textual log data from an online chat forum related to the
Anonymous hacktivist group. You will learn how to apply regular expressions, summarize log data,
quantify text data, and summarize time trends.
DATA
IRC is an early protocol for instant messaging developed in the early years of the Internet. The
openness and ability to remain anonymous has made IRC a popular channel for hacker networks to
collaborate and share ideas.
The data comes from https://www.azsecure-data.org/internet-relay-chat.html. It contains two years
of chats between hackers associated with the hacktivist group Anonymous. In these logs they share
information about malware, setting up servers to deploy attacks, and other information related to
hacking systems.
The collection and analysis of these chats is a form of cyber-threat intelligence. The analysis of these
chats and other dark web data sources enable proactive defense against attacks.
ANALYSIS
1. Many users log in and view the chat without commenting. Which users spent the most time
in the logs? (3pts) Which users logged in the most (2pts)
2. Find the most common words (3 pts)
3. Count the total number of written messages (only those with actual text content) (2 pts).
Summarize the users that posted the most messages (2pts)
4. Find and rank (by count) words not in an English dictionary (3 pts). This is a simple method
that can identify some names of malware tools
5. Which hours of the day had the most messages (2pts)? Which days had the most traffic (or
messages) (2pts)?
6. Find and list the URLs posted in the chat. (2pts)

Your Job Feed