-
Notifications
You must be signed in to change notification settings - Fork 16
/
sample_misp_playbook_to_cacao_malware_triage.json
518 lines (518 loc) · 165 KB
/
sample_misp_playbook_to_cacao_malware_triage.json
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
{
"type": "playbook",
"spec_version": "cacao-2.0",
"id": "playbook--3bc94d2b-598a-4c33-8f7d-44d07ac7652d",
"created": "2024-10-16T07:41:53Z",
"modified": "2024-10-16T07:41:53Z",
"created_by": "identity--ee23aaac-78d1-4b51-a88c-74dde045af42",
"name": "Malware triage with MISP",
"description": "Malware triage with MISP",
"labels": [
"malware",
"triage",
"incidentresponse",
"ir",
"dfir"
],
"playbook_types": [
"investigation"
],
"playbook_activities": [
"PR:1 Initialise environment",
"PR:2 Verify MISP modules",
"PR:3 Load helper functions",
"PR:4 Set helper variables",
"PR:5 Load and defang samples",
"PR:6 MISP event details",
"PR:7 Setup MISP event link",
"IN:1 File analysis",
"IN:2 VirusTotal results",
"IN:3 Hashlookup",
"IN:4 MalwareBazaar",
"IN:5 PE imports and exports",
"IN:6 MISP report for investigation",
"CR:1 Correlation with MISP events",
"CR:2 Correlation with MISP feeds",
"SA:1 Store in MWDB",
"EN:1 MISP indicators",
"EN:2 Create the summary of the playbook",
"EN:3 Print the input for malware triage",
"EN:4 Send a summary to Mattermost",
"EN:5 End of the playbook"
],
"workflow_start": "start--75f753e3-be32-4156-8b1f-188b3bac596a",
"workflow": {
"start--75f753e3-be32-4156-8b1f-188b3bac596a": {
"type": "start",
"name": "Start of Playbook",
"on_completion": "action--780722a7-1a73-40eb-bd71-b4289b818ea0"
},
"action--780722a7-1a73-40eb-bd71-b4289b818ea0": {
"type": "action",
"name": "Introduction",
"description": "# Malware triage with MISP\n\n## Introduction\n\n- UUID: **68b42f4d-8d5e-46c4-97f8-b87b9df210a3**\n- Started from [issue 2](https://github.com/MISP/misp-playbooks/issues/2)\n- State: **Published**\n- Purpose: A playbook to provide an analyst sufficient information to do basic malware triage on one or more samples.\n - This playbook creates a **MISP event** for malware triage. \n - Samples are **attached** to a MISP event (with file object relations).\n - VirusTotal and MalwareBazaar are used to get the **detection rate**, **threat classification** and **sandbox** information.\n - Hashlookup is used to check for **known hashes**.\n - PEfile analysis is done for **imports** and **exports**.\n - The results are stored in **MISP reports** and as MISP objects where relevant.\n - Correlations with MISP events or data feeds are added to a summary.\n - The sample is shared with a local instance of **MWDBcore**.\n - The summary is then sent to Mattermost.\n- Tags: [ \"malware\", \"triage\", \"incidentresponse\", \"ir\", \"dfir\" ]\n- External resources: **VirusTotal, Hashlookup, MalwareBazaar, MWDB, Mattermost**\n- Target audience: **SOC**, **CSIRT**, **CTI**\n- Notes:\n - Samples are stored in a \"malwarezoo\", defined in the variable `malwarezoo`.\n - All samples that do not have an extension \"do-not-run\" (set in `defangsuffix`) are considered as unprocessed.\n - The playbook adds the defangsuffix when it processes the samples.\n - For VirusTotal and MalwareBazaar, both the MISP modules as well as direct queries to VT/MB are used.",
"commands": [
{
"type": "manual",
"command": ""
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--059f72f4-25e6-40fb-b80d-b8c84ed26437"
},
"action--059f72f4-25e6-40fb-b80d-b8c84ed26437": {
"type": "action",
"name": "PR:1 Initialise environment",
"description": "# Preparation\n\n## PR:1 Initialise environment\n\nThis section **initialises the playbook environment** and loads the required Python libraries. \n\nThe credentials for MISP (**API key**) and other services are loaded from the file `keys.py` in the directory **vault**. A [PyMISP](https://github.com/MISP/PyMISP) object is created to interact with MISP and the active MISP server is displayed. By printing out the server name you know that it's possible to connect to MISP. In case of a problem PyMISP will indicate the error with `PyMISPError: Unable to connect to MISP`.\n\nThe contents of the `keys.py` file should contain at least :\n\n```\nmisp_url=\"<MISP URL>\" # The URL to our MISP server\nmisp_key=\"<MISP API KEY>\" # The MISP API key\nmisp_verifycert=<True or False> # Indicate if PyMISP should attempt to verify the certificate or ignore errors\n```",
"commands": [
{
"type": "manual",
"command": "# Initialise Python environment\nimport urllib3\nimport sys\nimport json\nfrom pyfaup.faup import Faup\nfrom prettytable import PrettyTable, MARKDOWN\nfrom IPython.display import Image, display, display_markdown, HTML\nfrom datetime import date\nimport requests\nimport uuid\n#from uuid import uuid4\nfrom pymisp import *\nfrom pymisp.tools import GenericObjectGenerator\nfrom pymisp.tools import make_binary_objects\nimport logging\n\nimport os\nimport time\nimport pefile\nfrom datetime import datetime\nfrom mwdblib import MWDB\n\n# Load the credentials\nsys.path.insert(0, \"../vault/\")\nfrom keys import *\nif misp_verifycert is False:\n import urllib3\n urllib3.disable_warnings(urllib3.exceptions.InsecureRequestWarning)\nprint(\"The \\033[92mPython libraries\\033[90m are loaded and the \\033[92mcredentials\\033[90m are read from the keys file.\")\n\n# Create the PyMISP object\nmisp = PyMISP(misp_url, misp_key, misp_verifycert)\nprint(\"I will use the MISP server \\033[92m{}\\033[90m for this playbook.\\n\\n\".format(misp_url))\n\n# Create headers for specific cases\nmisp_headers = {\"Authorization\": misp_key, \"Content-Type\": \"application/json\", \"Accept\": \"application/json\"}"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--c0ff32ee-cf58-4fb3-948d-40670cd0f8e1"
},
"action--c0ff32ee-cf58-4fb3-948d-40670cd0f8e1": {
"type": "action",
"name": "PR:2 Verify MISP modules",
"description": "## PR:2 Verify MISP modules\n\nThis playbook uses the MISP modules to obtain additional correlation or enrichment information. [MISP modules](https://github.com/MISP/misp-modules) are autonomous modules that can be used to extend MISP for new services such as expansion, import and export. The modules are written in Python 3 following a simple API interface. The objective is to ease the extensions of MISP functionalities without modifying core components. The API is available via a simple REST API which is independent from MISP installation or configuration.\n\nIn the next cell we check if we have access to the **MISP module** server and if the required modules are enabled.",
"commands": [
{
"type": "manual",
"command": "# Where can we find the local MISP Module server? You can leave this to the default setting in most cases.\nmisp_modules_url = \"http://127.0.0.1:6666\"\n\n# How long do we wait between queries when using the MISP modules (API rate limiting of external service such as VirusTotal)\nmisp_modules_wait = 3\n\n# Initiliasation\nmisp_modules = {}\nmisp_modules_headers = {\n \"Content-Type\": \"application/json\",\n \"Accept\": \"application/json\"\n}\nmisp_modules_in_use = [\"hashlookup\", \"virustotal_public\", \"malwarebazaar\"]\n# Code block to query the MISP module server and check if our modules are enabled\nres = requests.get(\"{}/modules\".format(misp_modules_url), headers=misp_modules_headers)\nfor module in res.json():\n for module_requested in misp_modules_in_use:\n if module.get(\"name\", False) == module_requested:\n misp_modules[module_requested] = {\"enabled\": True, \"input\": module.get(\"mispattributes\").get(\"input\")}\n print(\"Found the \\033[92m{}\\033[90m MISP module (Accepted input: {}).\".format(module_requested, misp_modules[module_requested][\"input\"]))\nprint(\"\\n\")"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--20de29d0-f151-4a10-9830-f878cf4defc6"
},
"action--20de29d0-f151-4a10-9830-f878cf4defc6": {
"type": "action",
"name": "PR:3 Load helper functions",
"description": "## PR:3 Load helper functions\n\nThe next cell contains **helper functions** that are used in this playbook. \n\nInstead of distributing helper functions as separate Python files this playbook includes all the required code as one code cell. This makes portability of playbooks between instances easier. The downside is that functions defined in this playbook need to be defined again in other playbooks, which is not optimal for code re-use. For this iteration of playbooks it is chosen to include the code in the playbook (more portability), but you can easily create one \"helper\" file that contains all the helper code and then import that file in each playbook (for example by adding to the previous cell `from helpers import *`). Note that the graphical workflow image is included as an external image. A missing image would not influence the further progress of the playbook.",
"commands": [
{
"type": "manual",
"command": "def pb_get_misp_tags(tags=[], local_tags=[]):\n '''\n Get a list of MISP tags based on a Python list\n\n :param misp: MISP object\n :param object_template: which object template to return\n '''\n misp_tags = []\n for el in tags:\n t = MISPTag()\n t.name = el\n t.local = False\n misp_tags.append(t)\n\n for el in local_tags:\n t = MISPTag()\n t.name = el\n t.local = True\n misp_tags.append(t)\n return misp_tags\n\n\ndef calculate_risk_score(ratio):\n '''\n Calculate risk score and return the score and a tag\n '''\n detection_ratio_perc = 0\n score = 0\n base = 0\n tag = \"\"\n if len(ratio) > 0:\n for detection_ratio in ratio:\n score += int(detection_ratio.split(\"/\")[0])\n base += int(detection_ratio.split(\"/\")[1])\n if base > 0:\n detection_ratio_perc = round((score / base) * 100, 2)\n if detection_ratio_perc > 0:\n if detection_ratio_perc >= 75:\n tag = \"misp:threat-level=\\\"high-risk\\\"\"\n elif detection_ratio_perc >= 50:\n tag = \"misp:threat-level=\\\"medium-risk\\\"\"\n #elif detection_ratio_perc >= 25:\n else:\n tag = \"misp:threat-level=\\\"low-risk\\\"\"\n else:\n tag = \"misp:threat-level=\\\"no-risk\\\"\"\n return detection_ratio_perc, tag"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--f5e57d58-0eff-448b-94fb-b00371467294"
},
"action--f5e57d58-0eff-448b-94fb-b00371467294": {
"type": "action",
"name": "PR:4 Set helper variables",
"description": "## PR:4 Set helper variables\n\nThis cell contains **helper variables** that are used in this playbook. Their usage is explained in the next steps of the playbook.\n\n- `samples` : a dictionary of the objects that are created when the playbook progresses\n- `malwarezoo` : the location where the samples are uploaded\n- `defangsuffix` : the suffix to add when defanging the samples",
"commands": [
{
"type": "manual",
"command": "# Dictionary to store playbook results\nsamples = {}\n\n# Location to upload samples\nmalwarezoo = \"/data/notebook-demo/playbook/notebooks/evidence\"\n\n# Suffix to add when defanging the samples\ndefangsuffix = \".do-not-run\"\n\n# Helper variables\nmisp_event = False\nsummary = \"\""
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--4427808c-ac8f-466d-827a-7650479675b2"
},
"action--4427808c-ac8f-466d-827a-7650479675b2": {
"type": "action",
"name": "PR:5 Load and defang samples",
"description": "## PR:5 Load and defang samples\n\nThis section checks if there are unprocessed samples in `malwarezoo` and **loads these samples** in the dictionary `samples`. It will also defang the sample to prevent accidential execution. A sample with the `defangsuffix` suffix is considered as processed.",
"commands": [
{
"type": "manual",
"command": "# Check if the malware samples directory exists and misp modules are enabled\nmalwarezoo_errors = False\n\nmalwarezoo_samples = [file for file in os.listdir(malwarezoo) if os.path.isfile(os.path.join(malwarezoo, file))]\nif len(malwarezoo_samples) == 0:\n print(\"There are no files in \\033[91m{}\\033[90m.\".format(malwarezoo))\n print(\"\\033[91mUnable to proceed!\\033[90m\")\nelse:\n # Defang the samples before we continue\n print(\"Defang samples\")\n for sample in malwarezoo_samples:\n if defangsuffix not in sample:\n sample_path = \"{}/{}{}\".format(malwarezoo, sample, defangsuffix)\n sample_orig_path = \"{}/{}\".format(malwarezoo, sample)\n os.system(\"mv {} {}\".format(sample_orig_path, sample_path))\n samples[sample] = {\"path\": sample_path, \"orig_path\": sample_orig_path}\n samples[sample][\"MISP\"] = []\n samples[sample][\"Feeds\"] = []\n samples[sample][\"detection_ratio\"] = []\n samples[sample][\"detection_ratio_indirect\"] = []\n samples[sample][\"vt\"] = {}\n samples[sample][\"malwarebazaar\"] = {}\n samples[sample][\"hashlookup\"] = {}\n samples[sample][\"pe\"] = {}\n print(\" Renamed {} to \\033[92m{}\\033[90m\".format(sample, sample_path))\n else:\n print(\" Skip \\033[91m{}\\033[90m, already processed (defanged).\".format(sample))\n if len(samples) > 0:\n print(\"\\n\\033[92mContinue\\033[90m the playbook with \\033[92m{}\\033[90m samples.\\n\".format(len(samples)))\n else:\n print(\"All samples in the malwarezoo ({}) are \\033[91malready processed\\033[90m (defanged).\\n\".format(malwarezoo))"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--c06ffc4b-1955-4166-a665-a8170708b2f9"
},
"action--c06ffc4b-1955-4166-a665-a8170708b2f9": {
"type": "action",
"name": "PR:6 MISP event details",
"description": "## PR:6 MISP event details\n\n### Event title\n\nIn this playbook we create a **new** MISP event with title **Malware triage for _malwarelist_**. You get the chance to override this default title but remember that it is good practice to choose a self-explanatory **event title**. This event title is shown in the MISP event index and should provide you the necessary information what the event is about. You should avoid using generic event titles. Read the [Best Practices in Threat Intelligence](https://www.misp-project.org/best-practices-in-threat-intelligence.html) for further guidance.\n\n### Contexualisation\n\nThis playbook adds event contexualisation via the **tags** that are defined in `event_additional_global_tags` (for *global* tags) and `event_additional_local_tags` (for *local* tags). As a reminder, whereas *global* tags remain attached to the events that you share with your community, the *local* tags are not shared outside your organisation. It's also a good idea to primarily use tags that are part of a [taxonomy](https://github.com/MISP/misp-taxonomies), this allows you to make the contexualisation more portable accross multiple MISP instances.\n\nIn this playbook the list of tags is build via one of the helper functions `pb_get_misp_tags`. This function takes two arguments, first a list of tags to convert as *global* tags, and secondly a list of tags to convert as *local* tags. It then returns a Python list of MISPTag objects.\n\n### Traffic Light Protocol\n\nThe default **TLP** for this event is **<span style='color:#FFBF00'>tlp:amber</span>**. The Traffic Light Protocol (TLP) facilitates sharing of potentially sensitive information and allows for more effective collaboration. TLP is a set of four standard labels to indicate the sharing boundaries to be applied by the recipients. TLP is always set by the creator of information. You can find more information at [FIRST](https://www.first.org/tlp/). You can specify the TLP via `event_tlp`.\n\n### MISP galaxies\n\nThis playbook can also add MISP galaxies to the event with the variable `event_galaxies`. You can also leave the list empty if you do not want to add galaxies in this stage of the investigation.\n\n### MISP distribution, threat level and analysis level\n\nOptionally you can specifiy a MISP **distribution** (with `event_distribution`), **threat level** (with `event_threat_level_id`) or **analysis state** (with `event_analysis`). The event **date** is set to today via `event_date`.\n\nIf you cannot remember the options for distribution, threat level or the analysis state then use the next cell to guide you. This cell is set as **raw**. If you **change its type to code** and execute the cell you get an overview of the options available for creating a MISP event.",
"commands": [
{
"type": "manual",
"command": ""
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--b474b97f-4cd7-4d1f-b682-f67ee4252eaf"
},
"action--b474b97f-4cd7-4d1f-b682-f67ee4252eaf": {
"type": "action",
"name": "PR:7 Setup MISP event link",
"description": "## PR:7 Setup MISP event link\n\nBy default the playbook will generate a **title** with a prefix and the malware you want to investigate. You can override this event title with the variable `event_title`. If you leave this value empty the playbook will generate the MISP event title for you.",
"commands": [
{
"type": "manual",
"command": "# Provide the event title for a new event. Leave blank for the playbook to auto generate one\nevent_title = \"\"\n\n# Prefix for auto generate event title\nevent_title_default_prefix = \"Malware triage\"\n\n# Optionally, you can change TLP, add additional event (local and global) tags, threatlevel, analysis state or distribution level\nevent_tlp = \"tlp:amber\"\n\n# Event context\nevent_additional_global_tags = [] # This needs to be a Python list\nevent_additional_local_tags = [\"workflow:state=\\\"incomplete\\\"\"] # This needs to be a Python list\n\n# Event galaxies\nevent_galaxies = [ \"misp-galaxy:mitre-attack-pattern=\\\"Malware - T1587.001\\\"\", \"misp-galaxy:mitre-attack-pattern=\\\"Malware - T1588.001\\\"\"]\n\n# Additional MISP event settings\nevent_threat_level_id = ThreatLevel.low\nevent_analysis = Analysis.ongoing\nevent_distribution = Distribution.your_organisation_only\nevent_date = date.today()"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--b2bb108f-1dba-49e1-be21-e46cc4592e5a"
},
"action--b2bb108f-1dba-49e1-be21-e46cc4592e5a": {
"type": "action",
"name": "Create MISP event",
"description": "### Create MISP event\n\nThe next code cell will **create the MISP event** and store the references to the newly created event in the variable `misp_event`. This variable is used further when the playbook progresses.",
"commands": [
{
"type": "manual",
"command": "# Code block to create the event or add data to an existing event\nevent_title = event_title.strip()\n\nif not(len(event_title) > 0):\n query_sample = \"\"\n for key, sample in samples.items():\n query_sample = \"{} {}\".format(key, query_sample)\n event_title = \"{} for {}\".format(event_title_default_prefix, query_sample)\n\n# Construct the event tags\nevent_additional_global_tags.append(event_tlp)\nif len(event_galaxies) > 0:\n event_additional_global_tags.append(event_galaxies)\nevent_tags = pb_get_misp_tags(event_additional_global_tags, event_additional_local_tags)\n\n# Create the PyMISP object for an event\nevent = MISPEvent()\nevent.info = event_title\nevent.distribution = event_distribution\nevent.threat_level_id = event_threat_level_id\nevent.analysis = event_analysis\nevent.set_date(event_date)\n\n# Create the MISP event on the server side\nmisp_event = misp.add_event(event, pythonify=True)\nprint(\"Continue the playbook with the new \\033[92mcreated\\033[90m MISP event ID {} with title \\033[92m{}\\033[90m and UUID {}\".format(misp_event.id, misp_event.info, misp_event.uuid))\nfor tag in event_tags:\n if len(tag) > 0:\n misp.tag(misp_event.uuid, tag, local=tag.local)\n print(\"\\033[92mAdded\\033[90m event tag {}\".format(tag))\nprint(\"\\n\")"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--78f6bddc-589c-4cb9-9f73-826675f614c5"
},
"action--78f6bddc-589c-4cb9-9f73-826675f614c5": {
"type": "action",
"name": "IN:2 VirusTotal results",
"description": "## IN:2 VirusTotal results\n\nNext we use [VirusTotal](https://www.virustotal.com/gui/). First we use the MISP modules to query VirusTotal. The advantage of using the MISP modules is that it returns MISP attributes and objects that can then be easily added to the event.\n\nAfter using the modules, we query VirusTotal directly (without using the module) to get the **threat classification**, **sandbox** results and the **Sigma** rules that triggered detection.\n\nNote that the sample is **not** submitted to VirusTotal for analysis.",
"commands": [
{
"type": "manual",
"command": "# Loop through the samples\n\n# Use the modules to create MISP objects\n# Then do a second query to get the file details\n\nscan_count = 0\nmodule_name = \"virustotal_public\"\nheaders = {\n \"accept\": \"application/json\",\n \"x-apikey\": virustotal_apikey\n}\n\nprint(\"Verifying samples with {}\".format(module_name))\nfor key, sample in samples.items():\n attribute_type = \"sha256\"\n value = sample.get(\"sha256\")\n module_comment = \"From {} for {}\".format(module_name, key)\n data = {\n \"attribute\": {\n \"type\": f\"{attribute_type}\",\n \"uuid\": str(uuid.uuid4()),\n \"value\": f\"{value}\",\n },\n \"module\": module_name,\n \"config\": {\"apikey\": virustotal_apikey}\n }\n print(\"Query \\033[92m{}\\033[90m as \\033[92m{}\\033[90m\".format(value, attribute_type))\n result = requests.post(\"{}/query\".format(misp_modules_url), headers=misp_modules_headers, json=data)\n if \"results\" in result.json() and len(result.json()[\"results\"]) > 0:\n result_json = result.json()[\"results\"]\n\n for misp_attribute in result_json.get(\"Attribute\", []):\n misp_attribute[\"comment\"] = \"{}{}\".format(module_comment, misp_attribute.get(\"comment\", \"\"))\n created_attribute = misp.add_attribute(misp_event.uuid, misp_attribute, pythonify=True)\n if not \"errors\" in created_attribute:\n print(\" Got {} \\033[92m{}\\033[90m\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_attribute.uuid, \"related-to\"))\n else:\n print(\" Unable to add {} \\033[92m{}\\033[90m to MISP event\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n for misp_object in result_json.get(\"Object\", []):\n misp_object[\"comment\"] = \"{}{}\".format(module_comment, misp_object.get(\"comment\", \"\"))\n if len(misp_object[\"Attribute\"]) > 0:\n created_object = misp.add_object(misp_event.uuid, misp_object, pythonify=True)\n if not \"errors\" in created_object:\n print(\" Got \\033[92m{}\\033[90m \".format(misp_object[\"name\"]))\n misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_object.uuid, \"related-to\"))\n if misp_object[\"name\"] == \"virustotal-report\":\n direct_vt_report = False\n detection_ratio = False\n for attribute in misp_object.get(\"Attribute\", []):\n if attribute.get(\"object_relation\") == \"permalink\" and value in attribute.get(\"value\"):\n direct_vt_report = True\n if attribute.get(\"object_relation\") == \"detection-ratio\":\n detection_ratio = attribute.get(\"value\")\n if detection_ratio:\n if direct_vt_report:\n samples[key][\"detection_ratio\"].append(detection_ratio)\n print(\" Got detection ratio \\033[92m{}\\033[90m \".format(detection_ratio))\n else:\n samples[key][\"detection_ratio_indirect\"].append(detection_ratio)\n print(\" Got indirect detection ratio \\033[92m{}\\033[90m \".format(detection_ratio))\n else:\n print(\" Unable to add \\033[92m{}\\033[90m to MISP event\".format(misp_object[\"name\"]))\n\n risk1, tag1 = calculate_risk_score(sample.get(\"detection_ratio\"))\n risk2, tag2 = calculate_risk_score(sample.get(\"detection_ratio_indirect\"))\n if len(tag1) > 0:\n misp.tag(sample.get(\"sha256_uuid\"), tag1)\n misp.tag(sample.get(\"malware-sample_uuid\"), tag1)\n\n risk_rating = \"vt-detection-ratio={}\\nvt-detection-ratio-indirect={}\".format(risk1, risk2)\n samples[key][\"detection_ratio_perc\"] = risk1\n samples[key][\"detection_ratio_indirect_perc\"] = risk2\n samples[key][\"risk_rating\"] = risk_rating\n sample.get(\"fileobject\").add_attribute(\"text\", risk_rating)\n misp.update_object(sample.get(\"fileobject\"))\n\n #################################################################\n\n print(\" Query VirusTotal directly to get file details\")\n vt_url = \"https://www.virustotal.com/api/v3/files/{}\".format(value)\n result = requests.get(vt_url, headers=headers)\n if \"data\" in result.json() and len(result.json()[\"data\"]) > 0:\n result_json = result.json()[\"data\"][\"attributes\"]\n samples[key][\"vt\"][\"data\"] = result_json\n samples[key][\"vt\"][\"popular_threat_classification\"] = result_json.get(\"popular_threat_classification\", [])\n samples[key][\"vt\"][\"sigma_analysis_results\"] = result_json.get(\"sigma_analysis_results\", [])\n samples[key][\"vt\"][\"sandbox_verdicts\"] = result_json.get(\"sandbox_verdicts\", [])\n samples[key][\"vt\"][\"sandboxes\"] = []\n samples[key][\"vt\"][\"markdown\"] = \"\"\n\n if len(samples[key][\"vt\"][\"popular_threat_classification\"]) > 0:\n samples[key][\"vt\"][\"markdown\"] = \"#### Popular threat classifications\\n\"\n for el in samples[key][\"vt\"][\"popular_threat_classification\"]:\n if el == \"suggested_threat_label\":\n samples[key][\"vt\"][\"suggested_threat_label\"] = samples[key][\"vt\"][\"popular_threat_classification\"][el]\n samples[key][\"vt\"][\"markdown\"] = \"{}\\n- {}: {}\".format(samples[key][\"vt\"][\"markdown\"], el, samples[key][\"vt\"][\"popular_threat_classification\"][el])\n\n if len(samples[key][\"vt\"][\"sigma_analysis_results\"]) > 0:\n samples[key][\"vt\"][\"markdown\"] = \"{}\\n\\n#### Sigma analysis results\\n\".format(samples[key][\"vt\"][\"markdown\"])\n for el in samples[key][\"vt\"][\"sigma_analysis_results\"]:\n samples[key][\"vt\"][\"markdown\"] = \"{}\\n - {}\".format(samples[key][\"vt\"][\"markdown\"], el[\"rule_title\"])\n\n if len(samples[key][\"vt\"][\"sandbox_verdicts\"]) > 0:\n samples[key][\"vt\"][\"markdown\"] = \"{}\\n\\n#### Sandbox verdicts\\n\".format(samples[key][\"vt\"][\"markdown\"])\n for el in samples[key][\"vt\"][\"sandbox_verdicts\"]:\n samples[key][\"vt\"][\"sandboxes\"].append(el)\n samples[key][\"vt\"][\"markdown\"] = \"{}\\n - **{}**\".format(samples[key][\"vt\"][\"markdown\"], el)\n for el_sandbox in samples[key][\"vt\"][\"sandbox_verdicts\"][el]:\n samples[key][\"vt\"][\"markdown\"] = \"{}\\n - {}: {}\".format(samples[key][\"vt\"][\"markdown\"], el_sandbox, samples[key][\"vt\"][\"sandbox_verdicts\"][el][el_sandbox])\n print(\" Finished query VirusTotal directly\")\n else:\n print(\"No results for \\033[91m{}\\033[90m.\".format(value))\n scan_count += 1\n if scan_count > 5:\n print(\"Sleeping for {} seconds\".format(misp_modules_wait))\n time.sleep(misp_modules_wait)\n scan_count = 0\nprint(\"Finished {}\".format(module_name))"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--1e3855ff-0256-42e7-9e34-3ec214f01a3e"
},
"action--1e3855ff-0256-42e7-9e34-3ec214f01a3e": {
"type": "action",
"name": "IN:3 Hashlookup",
"description": "## IN:3 Hashlookup\n\nWe use [CIRCL Hashlookup](https://www.circl.lu/services/hashlookup/) to identify if any of the samples correspond with **known hashes**.",
"commands": [
{
"type": "manual",
"command": "# Loop through the samples\n\nmodule_name = \"hashlookup\"\nprint(\"Lookup the file in {}\".format(module_name))\nfor key, sample in samples.items():\n attribute_type = \"sha256\"\n value = sample.get(\"sha256\")\n module_comment = \"From {} for {}\".format(module_name, key)\n data = {\n \"attribute\": {\n \"type\": f\"{attribute_type}\",\n \"uuid\": str(uuid.uuid4()),\n \"value\": f\"{value}\",\n },\n \"module\": module_name,\n \"config\": {\"custom_API\": False}\n }\n print(\"Query \\033[92m{}\\033[90m as \\033[92m{}\\033[90m\".format(value, attribute_type))\n result = requests.post(\"{}/query\".format(misp_modules_url), headers=misp_modules_headers, json=data)\n if \"results\" in result.json() and len(result.json()[\"results\"]) > 0:\n result_json = result.json()[\"results\"]\n hashlookup_hit = False\n hashlookuptext = \"\"\n for misp_attribute in result_json.get(\"Attribute\", []):\n misp_attribute[\"comment\"] = \"{}{}\".format(module_comment, misp_attribute.get(\"comment\", \"\"))\n created_attribute = misp.add_attribute(misp_event.uuid, misp_attribute, pythonify=True)\n if not \"errors\" in created_attribute:\n print(\" Got {} \\033[92m{}\\033[90m\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_attribute.uuid, \"related-to\"))\n else:\n print(\" Unable to add {} \\033[92m{}\\033[90m to MISP event\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n for misp_object in result_json.get(\"Object\", []):\n misp_object[\"comment\"] = \"{}{}\".format(module_comment, misp_object.get(\"comment\", \"\"))\n if len(misp_object[\"Attribute\"]) > 0:\n created_object = misp.add_object(misp_event.uuid, misp_object, pythonify=True)\n if not \"errors\" in created_object:\n\n print(\" Got \\033[92m{}\\033[90m \".format(misp_object[\"name\"]))\n misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_object.uuid, \"related-to\"))\n for attribute in misp_object.get(\"Attribute\", []):\n if attribute[\"object_relation\"] not in [\"MD5\", \"SHA-1\", \"SHA-256\", \"SSDEEP\"]:\n hashlookuptext = \"{}: {}\\n{}\".format(attribute[\"object_relation\"], attribute[\"value\"], hashlookuptext)\n if attribute.get(\"object_relation\") == \"KnownMalicious\":\n hashlookup_hit = True\n else:\n print(\" Unable to add \\033[92m{}\\033[90m to MISP event\".format(misp_object[\"name\"]))\n\n if hashlookup_hit:\n misp.tag(sample.get(\"sha256_uuid\"), \"misp-workflow:analysis=\\\"known-file-hash\\\"\")\n misp.tag(sample.get(\"malware-sample_uuid\"), \"misp-workflow:analysis=\\\"known-file-hash\\\"\")\n\n sample.get(\"fileobject\").add_attribute(\"text\", hashlookuptext)\n samples[key][\"hashlookup\"][\"data\"] = result_json\n samples[key][\"hashlookup\"][\"markdown\"] = hashlookuptext\n misp.update_object(sample.get(\"fileobject\"))\n else:\n print(\"No results for \\033[91m{}\\033[90m.\".format(value))\nprint(\"Finished {}\".format(module_name))"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--5e6a08f4-39a2-4be9-a7cd-cf265fcd10f8"
},
"action--5e6a08f4-39a2-4be9-a7cd-cf265fcd10f8": {
"type": "action",
"name": "IN:4 MalwareBazaar",
"description": "## IN:4 MalwareBazaar\n\nNow we query [MalwareBazaar](https://bazaar.abuse.ch/). Similar as with VirusTotal, we use the MISP module to get useful attributes and objects. Afterwards we query MalwareBazaar directly to get more details such as vendor intel and tags.",
"commands": [
{
"type": "manual",
"command": "# Loop through the samples\n\nmodule_name = \"malwarebazaar\"\nprint(\"Lookup the file in {}\".format(module_name))\nfor key, sample in samples.items():\n attribute_type = \"sha256\"\n value = sample.get(\"sha256\")\n module_comment = \"From {} for {}\".format(module_name, key)\n data = {\n \"attribute\": {\n \"type\": f\"{attribute_type}\",\n \"uuid\": str(uuid.uuid4()),\n \"value\": f\"{value}\",\n },\n \"module\": module_name,\n }\n print(\"Query \\033[92m{}\\033[90m as \\033[92m{}\\033[90m\".format(value, attribute_type))\n result = requests.post(\"{}/query\".format(misp_modules_url), headers=misp_modules_headers, json=data)\n if \"results\" in result.json() and len(result.json()[\"results\"]) > 0:\n result_json = result.json()[\"results\"]\n for misp_attribute in result_json.get(\"Attribute\", []):\n misp_attribute[\"comment\"] = \"{}{}\".format(module_comment, misp_attribute.get(\"comment\", \"\"))\n created_attribute = misp.add_attribute(misp_event.uuid, misp_attribute, pythonify=True)\n if not \"errors\" in created_attribute:\n print(\" Got {} \\033[92m{}\\033[90m\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_attribute.uuid, \"related-to\"))\n else:\n print(\" Unable to add {} \\033[92m{}\\033[90m to MISP event\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n for misp_object in result_json.get(\"Object\", []):\n misp_object[\"comment\"] = \"{}{}\".format(module_comment, misp_object.get(\"comment\", \"\"))\n if len(misp_object[\"Attribute\"]) > 0:\n created_object = misp.add_object(misp_event.uuid, misp_object, pythonify=True)\n if not \"errors\" in created_object:\n print(\" Got \\033[92m{}\\033[90m \".format(misp_object[\"name\"]))\n misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_object.uuid, \"related-to\"))\n else:\n print(\" Unable to add \\033[92m{}\\033[90m to MISP event\".format(misp_object[\"name\"]))\n\n #################################################################\n\n print(\" Query MalwareBazaar directly to get file details\")\n url = \"https://mb-api.abuse.ch/api/v1/\"\n result = requests.post(url, data={\"query\": \"get_info\", \"hash\": value})\n if \"data\" in result.json() and len(result.json()[\"data\"]) > 0:\n result_json = result.json()[\"data\"][0]\n samples[key][\"malwarebazaar\"][\"markdown\"] = \"\"\n samples[key][\"malwarebazaar\"][\"data\"] = result_json\n samples[key][\"malwarebazaar\"][\"tags\"] = result_json.get(\"tags\", \"\")\n samples[key][\"malwarebazaar\"][\"signature\"] = result_json.get(\"signature\", \"\")\n\n if len(samples[key][\"malwarebazaar\"][\"tags\"]) > 0:\n samples[key][\"malwarebazaar\"][\"markdown\"] = \"{}\\n**Tags**\\n\".format(samples[key][\"malwarebazaar\"][\"markdown\"])\n for tag in samples[key][\"malwarebazaar\"][\"tags\"]:\n samples[key][\"malwarebazaar\"][\"markdown\"] = \"{}\\n - {}\\n\".format(samples[key][\"malwarebazaar\"][\"markdown\"], tag)\n\n if len(result_json.get(\"vendor_intel\", [])) > 0:\n if len(result_json.get(\"vendor_intel\", []).get(\"Triage\", [])) > 0:\n samples[key][\"malwarebazaar\"][\"markdown\"] = \"{}\\n**Triage tags and signatures**\\n\".format(samples[key][\"malwarebazaar\"][\"markdown\"])\n for tag in result_json.get(\"vendor_intel\", []).get(\"Triage\", []).get(\"tags\", []):\n samples[key][\"malwarebazaar\"][\"markdown\"] = \"{}\\n - {}\\n\".format(samples[key][\"malwarebazaar\"][\"markdown\"], tag)\n for signature in result_json.get(\"vendor_intel\", []).get(\"Triage\", []).get(\"signatures\", []):\n samples[key][\"malwarebazaar\"][\"markdown\"] = \"{}\\n - {}\\n\".format(samples[key][\"malwarebazaar\"][\"markdown\"], signature[\"signature\"])\n print(\" Finished query MalwareBazaar directly\")\n else:\n print(\"No results for \\033[91m{}\\033[90m.\".format(value))\nprint(\"Finished {}\".format(module_name))"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--c642b273-cfb6-42cc-aed5-00a8d6076e4c"
},
"action--c642b273-cfb6-42cc-aed5-00a8d6076e4c": {
"type": "action",
"name": "IN:5 PE imports and exports",
"description": "## IN:5 PE imports and exports\n\nNext we use `pefile` to analyse the **imports and exports**. The results are added as a **MISP report**.",
"commands": [
{
"type": "manual",
"command": "summary_pe = \"## PEFile summary\\n\\n\"\nprint(\"Using pefile\")\nfor key, sample in samples.items():\n summary_pe += \"## {}\\n\\n\".format(sample[\"path\"])\n\n try:\n pe = pefile.PE(sample[\"path\"])\n samples[key][\"pe\"][\"imports\"] = \"### Imports\\n\\n\"\n samples[key][\"pe\"][\"exports\"] = \"### Exports\\n\\n\"\n pe.parse_data_directories()\n\n print(\" Reading imports for {}\".format(sample[\"path\"]))\n try:\n for entry in pe.DIRECTORY_ENTRY_IMPORT:\n #print(\" {}\".format(entry.dll))\n samples[key][\"pe\"][\"imports\"] = \"{}\\n**{}**\\n\".format(samples[key][\"pe\"][\"imports\"], entry.dll)\n for imp in entry.imports:\n samples[key][\"pe\"][\"imports\"] = \"{}\\n - {} {}\\n\".format(samples[key][\"pe\"][\"imports\"], hex(imp.address), imp.name)\n except:\n print(\" No imports\")\n samples[key][\"pe\"][\"imports\"] = \"{}\\n *No imports*\\n\".format(samples[key][\"pe\"][\"imports\"])\n summary_pe += samples[key][\"pe\"][\"imports\"]\n\n print(\" Reading exports for {}\".format(sample[\"path\"]))\n try:\n for exp in pe.DIRECTORY_ENTRY_EXPORT.symbols:\n samples[key][\"pe\"][\"exports\"] = \"{}\\n - {} {} {}\\n\".format(samples[key][\"pe\"][\"exports\"], hex(pe.OPTIONAL_HEADER.ImageBase + exp.address), exp.name, exp.ordinal)\n except:\n print(\" No exports\")\n samples[key][\"pe\"][\"exports\"] = \"{}\\n *No exports*\\n\".format(samples[key][\"pe\"][\"exports\"])\n summary_pe += samples[key][\"pe\"][\"imports\"]\n except:\n print(\" \\033[91mPEFormatError\\033[90m for {}\\n\".format(sample[\"path\"]))\n summary_pe += \"\\n*PEFormatError*\\n\"\n\nevent_title = \"PEFile analysis\"\nprint(\"\\nCreating MISP report \\033[92m{}\\033[90m\".format(event_title))\nchunk_size = 61500\nfor i in range(0, len(summary_pe), chunk_size):\n chunk = summary_pe[i:i + chunk_size]\n event_report = MISPEventReport()\n event_title_edit = event_title\n if i > 0:\n event_title_edit = \"{} ({} > {})\".format(event_title, i, i + chunk_size)\n event_report.name = event_title_edit\n event_report.content = chunk\n result = misp.add_event_report(misp_event.id, event_report)\n if \"EventReport\" in result:\n print(\" Report ID: \\033[92m{}\\033[90m\".format(result.get(\"EventReport\", []).get(\"id\", 0)))\n else:\n print(\"Failed to create report for \\033[91m{}\\033[90m.\".format(event_title))"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--4f13756f-2808-440f-9188-850a8642fc37"
},
"action--4f13756f-2808-440f-9188-850a8642fc37": {
"type": "action",
"name": "IN:6 MISP report for investigation",
"description": "## IN:6 MISP report for investigation\n\nApart from the report on PE data, we also create a report on the previous investigation steps.",
"commands": [
{
"type": "manual",
"command": "summary_iv = \"## Investigation report\\n\\n\"\ncurrent_date = datetime.now()\nformatted_date = current_date.strftime(\"%Y-%m-%d\")\nfor key, sample in samples.items():\n summary_iv += \"## Analysis for {}\\n\".format(key)\n summary_iv += \"- Date: **{}**\\n\".format(formatted_date)\n summary_iv += \"- MISP event: **{}** ({})\\n\".format(misp_event.info, misp_event.id)\n summary_iv += \"### File details\\n\"\n summary_iv += \" - Path: {}\\n\".format(sample[\"path\"])\n summary_iv += \" - MD5: {}\\n\".format(sample[\"md5\"])\n summary_iv += \" - SHA256: {}\\n\".format(sample[\"sha256\"])\n summary_iv += \" - Entropy: {}\\n\".format(sample[\"entropy\"])\n summary_iv += \" - Mime-type: {}\\n\".format(sample[\"mimetype\"])\n summary_iv += \"### Risk\\n\"\n if sample.get(\"detection_ratio\", False):\n summary_iv += \" - Detection ratio: **{}**\\n\".format(sample[\"detection_ratio\"])\n summary_iv += \" - Detection ratio percentage: **{}**\\n\".format(sample[\"detection_ratio_perc\"])\n summary_iv += \" - Detection ratio indirect: {}\\n\".format(sample[\"detection_ratio_indirect\"])\n summary_iv += \" - Detection ratio indirect percentage: {}\\n\".format(sample[\"detection_ratio_indirect_perc\"])\n else:\n summary_iv += \"*No detection ratio found*\\n\\n\"\n if sample.get(\"risk_rating\", False):\n summary_iv += \" - Risk rating: **{}**\\n\".format(sample[\"risk_rating\"])\n summary_iv += \"### VirusTotal\\n\"\n if sample[\"vt\"].get(\"markdown\"):\n summary_iv += sample[\"vt\"][\"markdown\"]\n else:\n summary_iv += \"*No results*\\n\"\n summary_iv += \"\\n\\n\"\n summary_iv += \"### Hashlookup\\n\"\n if sample[\"hashlookup\"].get(\"markdown\"):\n summary_iv += sample[\"hashlookup\"][\"markdown\"]\n else:\n summary_iv += \"*No results*\\n\"\n summary_iv += \"\\n\\n\"\n summary_iv += \"### MalwareBazaar\\n\"\n if sample[\"malwarebazaar\"].get(\"markdown\"):\n summary_iv += sample[\"malwarebazaar\"][\"markdown\"]\n else:\n summary_iv += \"*No results*\\n\"\n summary_iv += \"\\n\\n\"\n\nevent_title = \"Malware analysis\"\nprint(\"Creating MISP report \\033[92m{}\\033[90m\".format(event_title))\nchunk_size = 61500\nfor i in range(0, len(summary_iv), chunk_size):\n chunk = summary_iv[i:i + chunk_size]\n event_report = MISPEventReport()\n event_title_edit = event_title\n if i > 0:\n event_title_edit = \"{} ({} > {})\".format(event_title, i, i + chunk_size)\n event_report.name = event_title_edit\n event_report.content = chunk\n result = misp.add_event_report(misp_event.id, event_report)\n if \"EventReport\" in result:\n print(\" Report ID: \\033[92m{}\\033[90m\".format(result.get(\"EventReport\", []).get(\"id\", 0)))\n else:\n print(\"Failed to create report for \\033[91m{}\\033[90m.\".format(event_title))"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--affc277a-00be-4c03-bc6c-0fb048352794"
},
"action--affc277a-00be-4c03-bc6c-0fb048352794": {
"type": "action",
"name": "CR:1 Correlation with MISP events",
"description": "# Correlation\n\n## CR:1 Correlation with MISP events\n\nThis cell searches the **MISP server** for events that have a match with the analysed samples.\n\nOnly **published** events (`correlation_published`) and attributes that have the **to_ids** flag (`correlation_to_ids`) set are take into account. There is a default limit of **1000 hits** (`correlation_limit`) and you can limit the search with tags (`correlation_match_tags`).",
"commands": [
{
"type": "manual",
"command": "# Only query for published MISP events\ncorrelation_published = False\n\n# Only consider those values that have the to_ids field set to True\ncorrelation_to_ids = True\n\n# Limit the returned results to 1000 attributes\ncorrelation_limit = 1000\n\n# Only return results corresponding with these tags\ncorrelation_match_tags = [\"tlp:amber\", \"tlp:white\"]\nprint(\"Search for correlating MISP events\")\n# Code block to query MISP and find the correlations\nfor key, sample in samples.items():\n value = [sample.get(\"sha256\"), sample.get(\"md5\")]\n search_match = misp.search(\"attributes\", to_ids=correlation_to_ids, value=value, tags=correlation_match_tags,\n published=correlation_published, limit=correlation_limit, pythonify=True)\n if len(search_match) > 0: \n for attribute in search_match:\n if attribute.Event.id != misp_event.id: # Skip the event we just created for this playbook\n print(\" Found \\033[92m{}\\033[90m in \\033[92m{}\\033[90m ({})\".format(attribute.value, attribute.Event.info, attribute.Event.id ))\n entry = {\"source\": \"MISP\", \"category\": attribute.category, \"type\": attribute.type, \"event_id\": attribute.Event.id, \"event_info\": attribute.Event.info}\n samples[key][\"MISP\"].append(entry)\nprint(\"Finished searching for correlations\")"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--795647c7-91ad-4e63-8a9e-0ce1c8ea089a"
},
"action--795647c7-91ad-4e63-8a9e-0ce1c8ea089a": {
"type": "action",
"name": "MISP events correlation table",
"description": "### MISP events correlation table\n\nThe correlation results are now stored in `samples`. Execute the next cell to display them in a table format. The table is also included in the summary sent to Mattermost and TheHive.",
"commands": [
{
"type": "manual",
"command": "# Put the correlations in a pretty table. We can use this table later also for the summary\ntable = PrettyTable()\ntable.field_names = [\"Source\", \"Value\", \"Category\", \"Type\", \"Event\", \"Event ID\"]\ntable.align[\"Value\"] = \"l\"\ntable.align[\"Category\"] = \"l\"\ntable.align[\"Type\"] = \"l\"\ntable.align[\"Event\"] = \"l\"\ntable.align[\"Event ID\"] = \"l\"\ntable._max_width = {\"Event\": 50}\nfor key, sample in samples.items():\n for match in sample[\"MISP\"]:\n table.add_row([match[\"source\"], key, match[\"category\"], match[\"type\"], match[\"event_info\"], match[\"event_id\"]])\nprint(table.get_string(sortby=\"Value\"))\ntable_mispevents = table"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--b54d8aa4-abda-43db-b2dd-dd6440ab9d2a"
},
"action--b54d8aa4-abda-43db-b2dd-dd6440ab9d2a": {
"type": "action",
"name": "CR:2 Correlation with MISP feeds",
"description": "## CR:2 Correlation with MISP feeds\n\nThis cell searches the **MISP feeds** for events that have a match with the analysed samples. The output of this cell is a table with all the matches. The output is also repeated at the end of the playbook.\n\nNote that the correlation lookup in the MISP feeds does not return the name of the MISP event, it returns the UUID of the event as title.",
"commands": [
{
"type": "manual",
"command": "print(\"Search in MISP feeds\")\nmisp_cache_url = \"{}/feeds/searchCaches/\".format(misp_url)\nmatch = False\nfor key, sample in samples.items():\n # Instead of GET, use POST (https://github.com/MISP/MISP/issues/7478)\n value = sample.get(\"sha256\")\n cache_results = requests.post(misp_cache_url, headers=misp_headers, verify=misp_verifycert, json={\"value\": value})\n for result in cache_results.json():\n if \"Feed\" in result:\n match = True\n print(\" Found \\033[92m{}\\033[90m in \\033[92m{}\\033[90m\".format(value, result[\"Feed\"][\"name\"]))\n for match in result[\"Feed\"][\"direct_urls\"]:\n entry = {\"source\": \"Feeds\", \"feed_name\": result[\"Feed\"][\"name\"], \"match_url\": match[\"url\"]}\n samples[key][\"Feeds\"].append(entry)\n\nprint(\"Finished searching in MISP feeds\")\nif not match:\n print(\"\\033[93mNo correlating information found in MISP feeds.\")"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--dd67e644-a1f0-4485-b551-ec61b19d8da7"
},
"action--dd67e644-a1f0-4485-b551-ec61b19d8da7": {
"type": "action",
"name": "MISP feed correlations table",
"description": "### MISP feed correlations table\n\nThe correlation results are now stored in `samples`. Execute the next cell to display them in a table format. The table is also included in the summary sent to Mattermost and TheHive.",
"commands": [
{
"type": "manual",
"command": "# Put the correlations in a pretty table. We can use this table later also for the summary\ntable = PrettyTable()\ntable.field_names = [\"Source\", \"Value\", \"Feed\", \"URL\"]\ntable.align[\"Value\"] = \"l\"\ntable.align[\"Feed\"] = \"l\"\ntable.align[\"Feed URL\"] = \"l\"\ntable._max_width = {\"Feed\": 50}\nfor key, sample in samples.items():\n for match in sample[\"Feeds\"]:\n table.add_row([match[\"source\"], key, match[\"feed_name\"], match[\"match_url\"]])\nprint(table.get_string(sortby=\"Value\"))\ntable_mispfeeds = table"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--ed98e018-ea5d-4acc-a292-875c9d3082ff"
},
"action--ed98e018-ea5d-4acc-a292-875c9d3082ff": {
"type": "action",
"name": "SA:1 Store in MWDB",
"description": "# Share the sample\n\n## SA:1 Store in MWDB\n\nThe Malware Repository MWDB, formerly known as Malwarecage, is a project from CERT.pl that is available as a service (via [MWDB-CERT.pl](https://mwdb.cert.pl/login) but you can also host its core component [MWDB](https://github.com/CERT-Polska/mwdb-core) on your own infrastructure. In this playbook there's the option to store the sample in your instance of MWDB. Future variants of the playbook include sharing the sample with well-known malware sandboxes.\n\nThere is a MISP module that allows to upload samples from MISP to MWDB, but in this case the playbook will interact directly with MWDB.",
"commands": [
{
"type": "manual",
"command": "mwdb_public = True\nmwdb_metakeys = False\nmwdb = MWDB(api_key=mwdb_apikey, api_url=mwdb_api_url)\nmwdb_tags = [\"misp\", \"playbook\"]\n\nsummary_mwdb = \"## Samples stored in MWDB\\n\"\nprint(\"Start sharing with MWDB\")\nfor key, sample in samples.items():\n sample_filename = key\n with open(sample[\"path\"], 'rb') as file:\n data = file.read()\n file_object = mwdb.upload_file(sample_filename, data, metakeys=mwdb_metakeys, public=mwdb_public)\n for tag in mwdb_tags:\n file_object.add_tag(tag)\n file_object.add_comment(\"Uploaded from MISP playbook for malware triage, via event {}\".format(misp_event.id))\n mwdb_link = \"{}{}\".format(mwdb_api_url.replace(\"/api\", \"/file/\"), file_object.md5)\n attribute = MISPAttribute()\n attribute.value = mwdb_link\n attribute.to_ids = False\n attribute.type = \"link\"\n attribute.disable_correlation = True\n attribute.comment = \"MWDB link for sample {}\".format(key)\n attribute_mwdb = misp.add_attribute(misp_event.uuid, attribute, pythonify=True)\n summary_mwdb += \" {} in [{}]({})\\n\".format(key, mwdb_link, mwdb_link)\n if not \"errors\" in attribute_mwdb:\n misp.add_object_reference(sample[\"fileobject\"].add_reference(attribute_mwdb.uuid, \"related-to\"))\n else:\n print(\" Unable to add attribute with link to MWDB to MISP\")\n print(\" Sample {} at \\033[92m{}\\033[90m\".format(key, mwdb_link))\nsummary_mwdb += \"\\n\"\nprint(\"Finished sharing\")"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--280b9d4b-4c55-41a3-b089-61122adfa43b"
},
"action--280b9d4b-4c55-41a3-b089-61122adfa43b": {
"type": "action",
"name": "EN:1 MISP indicators",
"description": "## EN:1 MISP indicators\n\nThe next section first **queries MISP for the indicators added to the MISP event** that is linked to the execution of this playbook.\n\nThe indicators are stored in the variable `indicator_table` (table format) and `indicator_raw_list` (in raw format) which is used in a later section to create the playbook summary.",
"commands": [
{
"type": "manual",
"command": "# Get all the indicators for our event and store this is in a table. We can also use this for the summary.\nindicator_search = misp.search(\"attributes\", uuid=misp_event.uuid, to_ids=True, pythonify=True)\nindicator_raw_list = []\nindicator_table = PrettyTable()\nif len(indicator_search) > 0:\n indicator_table.field_names = [\"Type\", \"Category\", \"Indicator\", \"Comment\"]\n indicator_table.align[\"Type\"] = \"l\"\n indicator_table.align[\"Category\"] = \"l\"\n indicator_table.align[\"Indicator\"] = \"l\"\n indicator_table.align[\"Comment\"] = \"l\"\n indicator_table.border = True\n for indicator in indicator_search:\n if indicator.value not in indicator_raw_list:\n comment = indicator.comment\n if hasattr(indicator, 'object_relation'):\n object_ind = misp.get_object(indicator.object_id, pythonify=True)\n comment = \"From '{}' object {} {}\".format(object_ind.name, object_ind.comment.lower(), comment.lower())\n indicator_table.add_row([indicator.type, indicator.category, indicator.value, comment])\n indicator_raw_list.append(indicator.value)\n print(\"Got \\033[92m{}\\033[90m indicator(s) from the event \\033[92m{}\\033[90m ({}).\\n\".format(len(indicator_raw_list), misp_event.info, misp_event.id))\nelse:\n print(\"\\033[93mNo indicators found in the event \\033[92m{}\\033[90m ({})\".format(misp_event.info, misp_event.id))"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--f1c3a54c-5906-4071-af9a-ffe96f03c463"
},
"action--f1c3a54c-5906-4071-af9a-ffe96f03c463": {
"type": "action",
"name": "Raw list of MISP indicators",
"description": "### Raw list of MISP indicators\n\nThe indicators are now stored in `indicator_search` (as Python objects) and `indicator_raw_list` (in raw format, only the indicators). Execute the next cell to display them in a table format. The table is also included in the summary sent to Mattermost and TheHive.",
"commands": [
{
"type": "manual",
"command": "if len(indicator_raw_list) > 0:\n print(indicator_table.get_string(sortby=\"Type\"))\n print(\"\\n\\nIndicator list in raw format:\")\n print(\"---------------------------------------------------\")\n for el in indicator_raw_list:\n print(\"{}\".format(el))\n print(\"---------------------------------------------------\")"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--0ff07d73-09ba-4bfe-a5f6-97038cae0ae6"
},
"action--0ff07d73-09ba-4bfe-a5f6-97038cae0ae6": {
"type": "action",
"name": "EN:2 Create the summary of the playbook",
"description": "## EN:2 Create the summary of the playbook\n\nThe next section creates a summary and stores the output in the variable `summary` in Markdown format. It also stores an intro text in the variable `intro`. These variables are later used when sending information to Mattermost or TheHive.",
"commands": [
{
"type": "manual",
"command": "summary = \"# MISP Playbook summary\\nMalware triage with MISP event: **{}** ({}/events/view/{}). \".format(misp_event.info, misp_url, misp_event.id)\n\nsummary += \"\\n\"\nsummary += summary_iv\nsummary += \"\\n\"\nintro = summary\n\nsummary += \"## Indicators\\n\\n\"\nsummary += \"### Indicators table\\n\\n\"\nif len(indicator_raw_list) > 0:\n indicator_table.set_style(MARKDOWN)\n summary += indicator_table.get_string(sortby=\"Type\")\n summary += \"\\n\\n\\n\" \n summary += \"### Indicators in **raw format**\\n\\n\"\n for indicator in indicator_raw_list:\n summary += \"{}\\n\\n\".format(indicator)\n summary += \"\\n\" \nelse:\n summary += \"There are no indicators\"\nsummary += \"\\n\\n\"\n\nsummary += \"## Correlations\\n\\n\"\nsummary += \"### MISP event matches\\n\\n\"\ntable_mispevents.set_style(MARKDOWN)\nsummary += table_mispevents.get_string()\nsummary += \"\\n\\n\"\n\nsummary += \"### MISP feed matches\\n\\n\"\ntable_mispfeeds.set_style(MARKDOWN)\nsummary += table_mispfeeds.get_string()\nsummary += \"\\n\\n\"\n\nsummary += summary_pe\n\nsummary += \"\\n\\n\"\n\nprint(\"The \\033[92msummary\\033[90m of the playbook is available.\\n\")"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--077c63d0-d508-4521-a137-d85e02977f20"
},
"action--077c63d0-d508-4521-a137-d85e02977f20": {
"type": "action",
"name": "EN:3 Print the input for malware triage",
"description": "## EN:3 Print the input for malware triage\n\nApart from the full summary of the investigation, the malware triage summary provides the necessary input for the analyst. This summary was previously also added as a MISP report.",
"commands": [
{
"type": "manual",
"command": "print(summary_iv)\n# Or print with parsed markdown\n# display_markdown(summary_iv, raw=True)"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--56946bb4-7f77-46c5-9418-2fb8c79a4e0d"
},
"action--56946bb4-7f77-46c5-9418-2fb8c79a4e0d": {
"type": "action",
"name": "EN:4 Send a summary to Mattermost",
"description": "## EN:4 Send a summary to Mattermost\n\nNow you can send the summary to Mattermost. You can send the summary in two ways by selecting one of the options for the variable `send_to_mattermost_option` in the next cell.\n\n- The default option where the entire summary is in the **chat**, or\n- a short intro and the summary in a **card**\n\nFor this playbook we rely on a webhook in Mattermost. You can add a webhook by choosing the gear icon in Mattermost, then choose Integrations and then **Incoming Webhooks**. Set a channel for the webhook and lock the webhook to this channel with *\"Lock to this channel\"*.",
"commands": [
{
"type": "manual",
"command": "send_to_mattermost_option = \"via a chat message\"\n#send_to_mattermost_option = \"via a chat message with card\"\nmessage = False\nif send_to_mattermost_option == \"via a chat message\":\n message = {\"username\": mattermost_playbook_user, \"text\": summary}\nelif send_to_mattermost_option == \"via a chat message with card\":\n message = {\"username\": mattermost_playbook_user, \"text\": intro, \"props\": {\"card\": summary}}\n\nif message:\n r = requests.post(mattermost_hook, data=json.dumps(message))\n r.raise_for_status()\nif message and r.status_code == 200:\n print(\"Summary is \\033[92msent to Mattermost.\\n\")\nelse:\n print(\"\\033[91mFailed to sent summary\\033[90m to Mattermost.\\n\")"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--86141b38-8865-4f63-ae9b-f551314c2ba9"
},
"action--86141b38-8865-4f63-ae9b-f551314c2ba9": {
"type": "action",
"name": "EN:5 End of the playbook",
"description": "## EN:5 End of the playbook",
"commands": [
{
"type": "manual",
"command": "print(\"\\033[92m End of the playbook\")"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "action--9f384216-61d0-4d16-aa6a-1f393092c870"
},
"action--9f384216-61d0-4d16-aa6a-1f393092c870": {
"type": "action",
"name": "MISP Playbook",
"description": "Copy of the MISP Playbook",
"commands": [
{
"type": "jupyter",
"command_b64": "{
 "cells": [
  {
   "cell_type": "markdown",
   "id": "ca1fbfb5",
   "metadata": {
    "tags": [
     "playbook-introduction"
    ]
   },
   "source": [
    "# Malware triage with MISP\n",
    "\n",
    "## Introduction\n",
    "\n",
    "- UUID: **68b42f4d-8d5e-46c4-97f8-b87b9df210a3**\n",
    "- Started from [issue 2](https://github.com/MISP/misp-playbooks/issues/2)\n",
    "- State: **Published**\n",
    "- Purpose: A playbook to provide an analyst sufficient information to do basic malware triage on one or more samples.\n",
    "    - This playbook creates a **MISP event** for malware triage. \n",
    "    - Samples are **attached** to a MISP event (with file object relations).\n",
    "    - VirusTotal and MalwareBazaar are used to get the **detection rate**, **threat classification** and **sandbox** information.\n",
    "    - Hashlookup is used to check for **known hashes**.\n",
    "    - PEfile analysis is done for **imports** and **exports**.\n",
    "    - The results are stored in **MISP reports** and as MISP objects where relevant.\n",
    "    - Correlations with MISP events or data feeds are added to a summary.\n",
    "    - The sample is shared with a local instance of **MWDBcore**.\n",
    "    - The summary is then sent to Mattermost.\n",
    "- Tags: [ \"malware\", \"triage\", \"incidentresponse\", \"ir\", \"dfir\" ]\n",
    "- External resources: **VirusTotal, Hashlookup, MalwareBazaar, MWDB, Mattermost**\n",
    "- Target audience: **SOC**, **CSIRT**, **CTI**\n",
    "- Notes:\n",
    "    - Samples are stored in a \"malwarezoo\", defined in the variable `malwarezoo`.\n",
    "    - All samples that do not have an extension \"do-not-run\" (set in `defangsuffix`) are considered as unprocessed.\n",
    "    - The playbook adds the defangsuffix when it processes the samples.\n",
    "    - For VirusTotal and MalwareBazaar, both the MISP modules as well as direct queries to VT/MB are used."
   ]
  },
  {
   "cell_type": "markdown",
   "id": "a512a2c1",
   "metadata": {
    "jp-MarkdownHeadingCollapsed": true,
    "tags": []
   },
   "source": [
    "# Playbook\n",
    "\n",
    "- **Playbook title**\n",
    "- - Introduction\n",
    "- **Preparation**\n",
    "    - PR:1 Initialise environment\n",
    "    - PR:2 Verify MISP modules\n",
    "    - PR:3 Load helper functions\n",
    "    - PR:4 Set helper variables\n",
    "    - P5:5 Load and defang samples\n",
    "    - PR:6 MISP event details\n",
    "    - PR:7 Setup MISP event link\n",
    "- **Investigate**\n",
    "    - IN:1 File analysis\n",
    "    - IN:2 VirusTotal results\n",
    "    - IN:3 Hashlookup\n",
    "    - IN:4 MalwareBazaar\n",
    "    - IN:5 PE imports and exports\n",
    "    - IN:6 MISP report for investigation\n",
    "- **Correlation**\n",
    "    - CR:1 Correlation with MISP events\n",
    "    - CR:2 Correlation with MISP feeds\n",
    "- **Share the sample**\n",
    "    - SA:1 Store in MWDB\n",
    "- **Summary**\n",
    "    - EN:1 MISP indicators\n",
    "    - EN:2 Create the summary of the playbook\n",
    "    - EN:3 Print the input for malware triage\n",
    "    - EN:4 Send a summary to Mattermost\n",
    "    - EN:5 End of the playbook\n",
    "    - External references\n",
    "    - Technical details"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "8dc24913",
   "metadata": {
    "tags": []
   },
   "source": [
    "# Preparation\n",
    "\n",
    "## PR:1 Initialise environment\n",
    "\n",
    "This section **initialises the playbook environment** and loads the required Python libraries. \n",
    "\n",
    "The credentials for MISP (**API key**) and other services are loaded from the file `keys.py` in the directory **vault**. A [PyMISP](https://github.com/MISP/PyMISP) object is created to interact with MISP and the active MISP server is displayed. By printing out the server name you know that it's possible to connect to MISP. In case of a problem PyMISP will indicate the error with `PyMISPError: Unable to connect to MISP`.\n",
    "\n",
    "The contents of the `keys.py` file should contain at least :\n",
    "\n",
    "```\n",
    "misp_url=\"<MISP URL>\"                  # The URL to our MISP server\n",
    "misp_key=\"<MISP API KEY>\"              # The MISP API key\n",
    "misp_verifycert=<True or False>        # Indicate if PyMISP should attempt to verify the certificate or ignore errors\n",
    "```"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "5f94c336",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Initialise Python environment\n",
    "import urllib3\n",
    "import sys\n",
    "import json\n",
    "from pyfaup.faup import Faup\n",
    "from prettytable import PrettyTable, MARKDOWN\n",
    "from IPython.display import Image, display, display_markdown, HTML\n",
    "from datetime import date\n",
    "import requests\n",
    "import uuid\n",
    "#from uuid import uuid4\n",
    "from pymisp import *\n",
    "from pymisp.tools import GenericObjectGenerator\n",
    "from pymisp.tools import make_binary_objects\n",
    "import logging\n",
    "\n",
    "import os\n",
    "import time\n",
    "import pefile\n",
    "from datetime import datetime\n",
    "from mwdblib import MWDB\n",
    "\n",
    "# Load the credentials\n",
    "sys.path.insert(0, \"../vault/\")\n",
    "from keys import *\n",
    "if misp_verifycert is False:\n",
    "    import urllib3\n",
    "    urllib3.disable_warnings(urllib3.exceptions.InsecureRequestWarning)\n",
    "print(\"The \\033[92mPython libraries\\033[90m are loaded and the \\033[92mcredentials\\033[90m are read from the keys file.\")\n",
    "\n",
    "# Create the PyMISP object\n",
    "misp = PyMISP(misp_url, misp_key, misp_verifycert)\n",
    "print(\"I will use the MISP server \\033[92m{}\\033[90m for this playbook.\\n\\n\".format(misp_url))\n",
    "\n",
    "# Create headers for specific cases\n",
    "misp_headers = {\"Authorization\": misp_key,  \"Content-Type\": \"application/json\", \"Accept\": \"application/json\"}"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "342cb027-5ca6-4e01-b53d-1625ffc9531f",
   "metadata": {},
   "source": [
    "## PR:2 Verify MISP modules\n",
    "\n",
    "This playbook uses the MISP modules to obtain additional correlation or enrichment information. [MISP modules](https://github.com/MISP/misp-modules) are autonomous modules that can be used to extend MISP for new services such as expansion, import and export. The modules are written in Python 3 following a simple API interface. The objective is to ease the extensions of MISP functionalities without modifying core components. The API is available via a simple REST API which is independent from MISP installation or configuration.\n",
    "\n",
    "In the next cell we check if we have access to the **MISP module** server and if the required modules are enabled."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "20953103-2dab-43c7-b423-39d9bdc9ae17",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Where can we find the local MISP Module server? You can leave this to the default setting in most cases.\n",
    "misp_modules_url = \"http://127.0.0.1:6666\"\n",
    "\n",
    "# How long do we wait between queries when using the MISP modules (API rate limiting of external service such as VirusTotal)\n",
    "misp_modules_wait = 3\n",
    "\n",
    "# Initiliasation\n",
    "misp_modules = {}\n",
    "misp_modules_headers = {\n",
    "    \"Content-Type\": \"application/json\",\n",
    "    \"Accept\": \"application/json\"\n",
    "}\n",
    "misp_modules_in_use = [\"hashlookup\", \"virustotal_public\", \"malwarebazaar\"]\n",
    "# Code block to query the MISP module server and check if our modules are enabled\n",
    "res = requests.get(\"{}/modules\".format(misp_modules_url), headers=misp_modules_headers)\n",
    "for module in res.json():\n",
    "    for module_requested in misp_modules_in_use:\n",
    "        if module.get(\"name\", False) == module_requested:\n",
    "            misp_modules[module_requested] = {\"enabled\": True, \"input\": module.get(\"mispattributes\").get(\"input\")}\n",
    "            print(\"Found the \\033[92m{}\\033[90m MISP module (Accepted input: {}).\".format(module_requested, misp_modules[module_requested][\"input\"]))\n",
    "print(\"\\n\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "840e7fc1-8b21-4005-bf7c-9ccd65a7d166",
   "metadata": {},
   "source": [
    "## PR:3 Load helper functions\n",
    "\n",
    "The next cell contains **helper functions** that are used in this playbook. \n",
    "\n",
    "Instead of distributing helper functions as separate Python files this playbook includes all the required code as one code cell. This makes portability of playbooks between instances easier. The downside is that functions defined in this playbook need to be defined again in other playbooks, which is not optimal for code re-use. For this iteration of playbooks it is chosen to include the code in the playbook (more portability), but you can easily create one \"helper\" file that contains all the helper code and then import that file in each playbook (for example by adding to the previous cell `from helpers import *`). Note that the graphical workflow image is included as an external image. A missing image would not influence the further progress of the playbook."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "e015944d-ef4f-4e37-8ec3-b01950f6ee76",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "def pb_get_misp_tags(tags=[], local_tags=[]):\n",
    "    '''\n",
    "    Get a list of MISP tags based on a Python list\n",
    "\n",
    "    :param misp: MISP object\n",
    "    :param object_template: which object template to return\n",
    "    '''\n",
    "    misp_tags = []\n",
    "    for el in tags:\n",
    "        t = MISPTag()\n",
    "        t.name = el\n",
    "        t.local = False\n",
    "        misp_tags.append(t)\n",
    "\n",
    "    for el in local_tags:\n",
    "        t = MISPTag()\n",
    "        t.name = el\n",
    "        t.local = True\n",
    "        misp_tags.append(t)\n",
    "    return misp_tags\n",
    "\n",
    "\n",
    "def calculate_risk_score(ratio):\n",
    "    '''\n",
    "    Calculate risk score and return the score and a tag\n",
    "    '''\n",
    "    detection_ratio_perc = 0\n",
    "    score = 0\n",
    "    base = 0\n",
    "    tag = \"\"\n",
    "    if len(ratio) > 0:\n",
    "        for detection_ratio in ratio:\n",
    "            score += int(detection_ratio.split(\"/\")[0])\n",
    "            base += int(detection_ratio.split(\"/\")[1])\n",
    "        if base > 0:\n",
    "            detection_ratio_perc = round((score / base) * 100, 2)\n",
    "            if detection_ratio_perc > 0:\n",
    "                if detection_ratio_perc >= 75:\n",
    "                    tag = \"misp:threat-level=\\\"high-risk\\\"\"\n",
    "                elif detection_ratio_perc >= 50:\n",
    "                    tag = \"misp:threat-level=\\\"medium-risk\\\"\"\n",
    "                #elif detection_ratio_perc >= 25:\n",
    "                else:\n",
    "                    tag = \"misp:threat-level=\\\"low-risk\\\"\"\n",
    "            else:\n",
    "                tag = \"misp:threat-level=\\\"no-risk\\\"\"\n",
    "    return detection_ratio_perc, tag"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "6b0fe9f4-23b1-4eec-bc61-8bc965ffc619",
   "metadata": {},
   "source": [
    "## PR:4 Set helper variables\n",
    "\n",
    "This cell contains **helper variables** that are used in this playbook. Their usage is explained in the next steps of the playbook.\n",
    "\n",
    "- `samples` : a dictionary of the objects that are created when the playbook progresses\n",
    "- `malwarezoo` : the location where the samples are uploaded\n",
    "- `defangsuffix` : the suffix to add when defanging the samples"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "722141bd-7ab2-478b-9615-88d628addd99",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Dictionary to store playbook results\n",
    "samples = {}\n",
    "\n",
    "# Location to upload samples\n",
    "malwarezoo = \"/data/notebook-demo/playbook/notebooks/evidence\"\n",
    "\n",
    "# Suffix to add when defanging the samples\n",
    "defangsuffix = \".do-not-run\"\n",
    "\n",
    "# Helper variables\n",
    "misp_event = False\n",
    "summary = \"\""
   ]
  },
  {
   "cell_type": "markdown",
   "id": "8fac1eb3-7c93-4bae-8700-363236e7a61d",
   "metadata": {},
   "source": [
    "## PR:5 Load and defang samples\n",
    "\n",
    "This section checks if there are unprocessed samples in `malwarezoo` and **loads these samples** in the dictionary `samples`. It will also defang the sample to prevent accidential execution. A sample with the `defangsuffix` suffix is considered as processed."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "cbe27164-b719-4d5c-888c-0d47e1a9f568",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Check if the malware samples directory exists and misp modules are enabled\n",
    "malwarezoo_errors = False\n",
    "\n",
    "malwarezoo_samples = [file for file in os.listdir(malwarezoo) if os.path.isfile(os.path.join(malwarezoo, file))]\n",
    "if len(malwarezoo_samples) == 0:\n",
    "    print(\"There are no files in \\033[91m{}\\033[90m.\".format(malwarezoo))\n",
    "    print(\"\\033[91mUnable to proceed!\\033[90m\")\n",
    "else:\n",
    "    # Defang the samples before we continue\n",
    "    print(\"Defang samples\")\n",
    "    for sample in malwarezoo_samples:\n",
    "        if defangsuffix not in sample:\n",
    "            sample_path = \"{}/{}{}\".format(malwarezoo, sample, defangsuffix)\n",
    "            sample_orig_path = \"{}/{}\".format(malwarezoo, sample)\n",
    "            os.system(\"mv {} {}\".format(sample_orig_path, sample_path))\n",
    "            samples[sample] = {\"path\": sample_path, \"orig_path\": sample_orig_path}\n",
    "            samples[sample][\"MISP\"] = []\n",
    "            samples[sample][\"Feeds\"] = []\n",
    "            samples[sample][\"detection_ratio\"] = []\n",
    "            samples[sample][\"detection_ratio_indirect\"] = []\n",
    "            samples[sample][\"vt\"] = {}\n",
    "            samples[sample][\"malwarebazaar\"] = {}\n",
    "            samples[sample][\"hashlookup\"] = {}\n",
    "            samples[sample][\"pe\"] = {}\n",
    "            print(\" Renamed {} to \\033[92m{}\\033[90m\".format(sample, sample_path))\n",
    "        else:\n",
    "            print(\" Skip \\033[91m{}\\033[90m, already processed (defanged).\".format(sample))\n",
    "    if len(samples) > 0:\n",
    "        print(\"\\n\\033[92mContinue\\033[90m the playbook with \\033[92m{}\\033[90m samples.\\n\".format(len(samples)))\n",
    "    else:\n",
    "        print(\"All samples in the malwarezoo ({}) are \\033[91malready processed\\033[90m (defanged).\\n\".format(malwarezoo))"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "9d750311-fc87-45e2-b3f2-88a1603f3dfd",
   "metadata": {},
   "source": [
    "## PR:6 MISP event details\n",
    "\n",
    "### Event title\n",
    "\n",
    "In this playbook we create a **new** MISP event with title **Malware triage for _malwarelist_**. You get the chance to override this default title but remember that it is good practice to choose a self-explanatory **event title**. This event title is shown in the MISP event index and should provide you the necessary information what the event is about. You should avoid using generic event titles. Read the [Best Practices in Threat Intelligence](https://www.misp-project.org/best-practices-in-threat-intelligence.html) for further guidance.\n",
    "\n",
    "### Contexualisation\n",
    "\n",
    "This playbook adds event contexualisation via the **tags** that are defined in `event_additional_global_tags` (for *global* tags) and `event_additional_local_tags` (for *local* tags). As a reminder, whereas *global* tags remain attached to the events that you share with your community, the *local* tags are not shared outside your organisation. It's also a good idea to primarily use tags that are part of a [taxonomy](https://github.com/MISP/misp-taxonomies), this allows you to make the contexualisation more portable accross multiple MISP instances.\n",
    "\n",
    "In this playbook the list of tags is build via one of the helper functions `pb_get_misp_tags`. This function takes two arguments, first a list of tags to convert as *global* tags, and secondly a list of tags to convert as *local* tags. It then returns a Python list of MISPTag objects.\n",
    "\n",
    "### Traffic Light Protocol\n",
    "\n",
    "The default **TLP** for this event is **<span style='color:#FFBF00'>tlp:amber</span>**. The Traffic Light Protocol (TLP) facilitates sharing of potentially sensitive information and allows for more effective collaboration. TLP is a set of four standard labels to indicate the sharing boundaries to be applied by the recipients. TLP is always set by the creator of information. You can find more information at [FIRST](https://www.first.org/tlp/). You can specify the TLP via `event_tlp`.\n",
    "\n",
    "### MISP galaxies\n",
    "\n",
    "This playbook can also add MISP galaxies to the event with the variable `event_galaxies`. You can also leave the list empty if you do not want to add galaxies in this stage of the investigation.\n",
    "\n",
    "### MISP distribution, threat level and analysis level\n",
    "\n",
    "Optionally you can specifiy a MISP **distribution** (with `event_distribution`), **threat level** (with `event_threat_level_id`) or **analysis state** (with `event_analysis`). The event **date** is set to today via `event_date`.\n",
    "\n",
    "If you cannot remember the options for distribution, threat level or the analysis state then use the next cell to guide you. This cell is set as **raw**. If you **change its type to code** and execute the cell you get an overview of the options available for creating a MISP event."
   ]
  },
  {
   "cell_type": "raw",
   "id": "a7d5fa25-0e22-4fd0-83d9-f25e9ceec056",
   "metadata": {
    "tags": []
   },
   "source": [
    "# This cell is 'raw' by default. Change it to 'code' to execute it.\n",
    "\n",
    "print(\"Distribution value\")\n",
    "for e in Distribution:\n",
    "    print(\" \", e.value, \"=\", e)\n",
    "print(\"\\n\")\n",
    "print(\"Threat level\")\n",
    "for e in ThreatLevel:\n",
    "    print(\" \", e.value, \"=\", e)\n",
    "print(\"\\n\")\n",
    "print(\"Analysis level\")\n",
    "for e in Analysis:\n",
    "    print(\" \", e.value, \"=\", e)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "17bc1f98-6996-4527-bf28-b3bcdd6c4bc8",
   "metadata": {},
   "source": [
    "## PR:7 Setup MISP event link\n",
    "\n",
    "By default the playbook will generate a **title** with a prefix and the malware you want to investigate. You can override this event title with the variable `event_title`. If you leave this value empty the playbook will generate the MISP event title for you."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "9bae3e5c-4d94-440e-866e-ec2717f2050c",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Provide the event title for a new event. Leave blank for the playbook to auto generate one\n",
    "event_title = \"\"\n",
    "\n",
    "# Prefix for auto generate event title\n",
    "event_title_default_prefix = \"Malware triage\"\n",
    "\n",
    "# Optionally, you can change TLP, add additional event (local and global) tags, threatlevel, analysis state or distribution level\n",
    "event_tlp = \"tlp:amber\"\n",
    "\n",
    "# Event context\n",
    "event_additional_global_tags = []                                 # This needs to be a Python list\n",
    "event_additional_local_tags = [\"workflow:state=\\\"incomplete\\\"\"]   # This needs to be a Python list\n",
    "\n",
    "# Event galaxies\n",
    "event_galaxies = [ \"misp-galaxy:mitre-attack-pattern=\\\"Malware - T1587.001\\\"\", \"misp-galaxy:mitre-attack-pattern=\\\"Malware - T1588.001\\\"\"]\n",
    "\n",
    "# Additional MISP event settings\n",
    "event_threat_level_id = ThreatLevel.low\n",
    "event_analysis = Analysis.ongoing\n",
    "event_distribution = Distribution.your_organisation_only\n",
    "event_date = date.today()"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "8896ef17-733f-43d7-bf8b-49a70f68340a",
   "metadata": {},
   "source": [
    "### Create MISP event\n",
    "\n",
    "The next code cell will **create the MISP event** and store the references to the newly created event in the variable `misp_event`. This variable is used further when the playbook progresses."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "3009f8b3-0588-423e-a7be-614dfbc5dc2a",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Code block to create the event or add data to an existing event\n",
    "event_title = event_title.strip()\n",
    "\n",
    "if not(len(event_title) > 0):\n",
    "    query_sample = \"\"\n",
    "    for key, sample in samples.items():\n",
    "        query_sample = \"{} {}\".format(key, query_sample)\n",
    "    event_title = \"{} for {}\".format(event_title_default_prefix, query_sample)\n",
    "\n",
    "# Construct the event tags\n",
    "event_additional_global_tags.append(event_tlp)\n",
    "if len(event_galaxies) > 0:\n",
    "    event_additional_global_tags.append(event_galaxies)\n",
    "event_tags = pb_get_misp_tags(event_additional_global_tags, event_additional_local_tags)\n",
    "\n",
    "# Create the PyMISP object for an event\n",
    "event = MISPEvent()\n",
    "event.info = event_title\n",
    "event.distribution = event_distribution\n",
    "event.threat_level_id = event_threat_level_id\n",
    "event.analysis = event_analysis\n",
    "event.set_date(event_date)\n",
    "\n",
    "# Create the MISP event on the server side\n",
    "misp_event = misp.add_event(event, pythonify=True)\n",
    "print(\"Continue the playbook with the new \\033[92mcreated\\033[90m MISP event ID {} with title \\033[92m{}\\033[90m and UUID {}\".format(misp_event.id, misp_event.info, misp_event.uuid))\n",
    "for tag in event_tags:\n",
    "    if len(tag) > 0:\n",
    "        misp.tag(misp_event.uuid, tag, local=tag.local)\n",
    "        print(\"\\033[92mAdded\\033[90m event tag {}\".format(tag))\n",
    "print(\"\\n\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "7ae6a36f-91bc-41d6-ad13-058a2a3f9b45",
   "metadata": {},
   "source": [
    "# Investigate\n",
    "\n",
    "This section starts the investigation for the malware triage.\n",
    "\n",
    "## IN:1 File analysis\n",
    "\n",
    "The first step **uploads the samples** to the MISP event and uses the **advanced extraction** features to identify useful elements. This creates a set of related objects in the event The results are added to the `samples` dictionary.\n",
    "\n",
    "The section temporarily sets the logging level of MISP to *error* so that it does not stop execution when errors occur when analysing specific sections of the samples."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "8517d47b-7017-4945-9b6a-3a7b9389c112",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Avoid getting PyMISP errors. We reset it to the original state afterwards\n",
    "current_logging = logging.getLogger(\"pymisp\").getEffectiveLevel()\n",
    "logging.getLogger(\"pymisp\").setLevel(logging.ERROR)\n",
    "\n",
    "# Loop through the samples\n",
    "print(\"Processing samples and attaching them to an event\")\n",
    "for key, sample in samples.items():\n",
    "    attachment = sample.get(\"path\")\n",
    "    try:\n",
    "        print(\" Working on %s\" % attachment)\n",
    "        fo, peo, seos = make_binary_objects(attachment)\n",
    "    except Exception:\n",
    "        continue\n",
    "\n",
    "    if seos:\n",
    "        print(\"  PESectionObject\")\n",
    "        for s in seos:\n",
    "            r = misp.add_object(misp_event.uuid, s)\n",
    "\n",
    "    if peo:\n",
    "        print(\"  PEObject\")\n",
    "        if hasattr(peo, 'certificates') and hasattr(peo, 'signers'):\n",
    "            for c in peo.certificates:\n",
    "                misp.add_object(misp_event.uuid, c, pythonify=True)\n",
    "            for s in peo.signers:\n",
    "                misp.add_object(misp_event.uuid, s, pythonify=True)\n",
    "            del peo.certificates\n",
    "            del peo.signers\n",
    "        del peo.sections\n",
    "        r = misp.add_object(misp_event.uuid, peo, pythonify=True)\n",
    "        for ref in peo.ObjectReference:\n",
    "            r = misp.add_object_reference(ref)\n",
    "\n",
    "    if fo:\n",
    "        print(\"  FileObject\")\n",
    "        response = misp.add_object(misp_event.uuid, fo, pythonify=True)\n",
    "        for ref in fo.ObjectReference:\n",
    "            r = misp.add_object_reference(ref)\n",
    "        samples[key][\"fileobject\"] = response\n",
    "        samples[key][\"fileobject_uuid\"] = response.uuid\n",
    "        print(\"   Added file object with UUID \\033[92m{}\\033[90m\".format(response.uuid))\n",
    "        for attribute in response.attributes:\n",
    "            if attribute.object_relation == \"sha256\":\n",
    "                samples[key][\"sha256\"] = attribute.value\n",
    "                samples[key][\"sha256_uuid\"] = attribute.uuid\n",
    "            if attribute.object_relation == \"md5\":\n",
    "                samples[key][\"md5\"] = attribute.value\n",
    "            if attribute.object_relation == \"malware-sample\":\n",
    "                samples[key][\"malware-sample_uuid\"] = attribute.uuid\n",
    "            if attribute.object_relation == \"entropy\":\n",
    "                samples[key][\"entropy\"] = attribute.value\n",
    "            if attribute.object_relation == \"mimetype\":\n",
    "                samples[key][\"mimetype\"] = attribute.value\n",
    "        print(\"    MD5: \\033[92m{}\\033[90m\\n    SHA256: \\033[92m{}\\033[90m\".format(samples[key][\"md5\"], samples[key][\"sha256\"]))\n",
    "\n",
    "logging.getLogger(\"pymisp\").setLevel(current_logging)\n",
    "print(\"Finished processing samples.\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "628a7134-66f9-454f-b4fb-241f2e3d97b3",
   "metadata": {},
   "source": [
    "## IN:2 VirusTotal results\n",
    "\n",
    "Next we use [VirusTotal](https://www.virustotal.com/gui/). First we use the MISP modules to query VirusTotal. The advantage of using the MISP modules is that it returns MISP attributes and objects that can then be easily added to the event.\n",
    "\n",
    "After using the modules, we query VirusTotal directly (without using the module) to get the **threat classification**, **sandbox** results and the **Sigma** rules that triggered detection.\n",
    "\n",
    "Note that the sample is **not** submitted to VirusTotal for analysis."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "5afafe0d-87b2-413f-8ee8-cdcff7fb3e90",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Loop through the samples\n",
    "\n",
    "# Use the modules to create MISP objects\n",
    "# Then do a second query to get the file details\n",
    "\n",
    "scan_count = 0\n",
    "module_name = \"virustotal_public\"\n",
    "headers = {\n",
    "    \"accept\": \"application/json\",\n",
    "    \"x-apikey\": virustotal_apikey\n",
    "}\n",
    "\n",
    "print(\"Verifying samples with {}\".format(module_name))\n",
    "for key, sample in samples.items():\n",
    "    attribute_type = \"sha256\"\n",
    "    value = sample.get(\"sha256\")\n",
    "    module_comment = \"From {} for {}\".format(module_name, key)\n",
    "    data = {\n",
    "        \"attribute\": {\n",
    "            \"type\": f\"{attribute_type}\",\n",
    "            \"uuid\": str(uuid.uuid4()),\n",
    "            \"value\": f\"{value}\",\n",
    "        },\n",
    "        \"module\": module_name,\n",
    "        \"config\": {\"apikey\": virustotal_apikey}\n",
    "    }\n",
    "    print(\"Query \\033[92m{}\\033[90m as \\033[92m{}\\033[90m\".format(value, attribute_type))\n",
    "    result = requests.post(\"{}/query\".format(misp_modules_url), headers=misp_modules_headers, json=data)\n",
    "    if \"results\" in result.json() and len(result.json()[\"results\"]) > 0:\n",
    "        result_json = result.json()[\"results\"]\n",
    "\n",
    "        for misp_attribute in result_json.get(\"Attribute\", []):\n",
    "            misp_attribute[\"comment\"] = \"{}{}\".format(module_comment, misp_attribute.get(\"comment\", \"\"))\n",
    "            created_attribute = misp.add_attribute(misp_event.uuid, misp_attribute, pythonify=True)\n",
    "            if not \"errors\" in created_attribute:\n",
    "                print(\" Got {} \\033[92m{}\\033[90m\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n",
    "                misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_attribute.uuid, \"related-to\"))\n",
    "            else:\n",
    "                print(\" Unable to add {} \\033[92m{}\\033[90m to MISP event\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n",
    "        for misp_object in result_json.get(\"Object\", []):\n",
    "            misp_object[\"comment\"] = \"{}{}\".format(module_comment, misp_object.get(\"comment\", \"\"))\n",
    "            if len(misp_object[\"Attribute\"]) > 0:\n",
    "                created_object = misp.add_object(misp_event.uuid, misp_object, pythonify=True)\n",
    "                if not \"errors\" in created_object:\n",
    "                    print(\" Got \\033[92m{}\\033[90m \".format(misp_object[\"name\"]))\n",
    "                    misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_object.uuid, \"related-to\"))\n",
    "                    if misp_object[\"name\"] == \"virustotal-report\":\n",
    "                        direct_vt_report = False\n",
    "                        detection_ratio = False\n",
    "                        for attribute in misp_object.get(\"Attribute\", []):\n",
    "                            if attribute.get(\"object_relation\") == \"permalink\" and value in attribute.get(\"value\"):\n",
    "                                direct_vt_report = True\n",
    "                            if attribute.get(\"object_relation\") == \"detection-ratio\":\n",
    "                                detection_ratio = attribute.get(\"value\")\n",
    "                        if detection_ratio:\n",
    "                            if direct_vt_report:\n",
    "                                samples[key][\"detection_ratio\"].append(detection_ratio)\n",
    "                                print(\" Got detection ratio \\033[92m{}\\033[90m \".format(detection_ratio))\n",
    "                            else:\n",
    "                                samples[key][\"detection_ratio_indirect\"].append(detection_ratio)\n",
    "                                print(\" Got indirect detection ratio \\033[92m{}\\033[90m \".format(detection_ratio))\n",
    "                else:\n",
    "                    print(\" Unable to add \\033[92m{}\\033[90m to MISP event\".format(misp_object[\"name\"]))\n",
    "\n",
    "        risk1, tag1 = calculate_risk_score(sample.get(\"detection_ratio\"))\n",
    "        risk2, tag2 = calculate_risk_score(sample.get(\"detection_ratio_indirect\"))\n",
    "        if len(tag1) > 0:\n",
    "            misp.tag(sample.get(\"sha256_uuid\"), tag1)\n",
    "            misp.tag(sample.get(\"malware-sample_uuid\"), tag1)\n",
    "\n",
    "        risk_rating = \"vt-detection-ratio={}\\nvt-detection-ratio-indirect={}\".format(risk1, risk2)\n",
    "        samples[key][\"detection_ratio_perc\"] = risk1\n",
    "        samples[key][\"detection_ratio_indirect_perc\"] = risk2\n",
    "        samples[key][\"risk_rating\"] = risk_rating\n",
    "        sample.get(\"fileobject\").add_attribute(\"text\", risk_rating)\n",
    "        misp.update_object(sample.get(\"fileobject\"))\n",
    "\n",
    "        #################################################################\n",
    "\n",
    "        print(\" Query VirusTotal directly to get file details\")\n",
    "        vt_url = \"https://www.virustotal.com/api/v3/files/{}\".format(value)\n",
    "        result = requests.get(vt_url, headers=headers)\n",
    "        if \"data\" in result.json() and len(result.json()[\"data\"]) > 0:\n",
    "            result_json = result.json()[\"data\"][\"attributes\"]\n",
    "            samples[key][\"vt\"][\"data\"] = result_json\n",
    "            samples[key][\"vt\"][\"popular_threat_classification\"] = result_json.get(\"popular_threat_classification\", [])\n",
    "            samples[key][\"vt\"][\"sigma_analysis_results\"] = result_json.get(\"sigma_analysis_results\", [])\n",
    "            samples[key][\"vt\"][\"sandbox_verdicts\"] = result_json.get(\"sandbox_verdicts\", [])\n",
    "            samples[key][\"vt\"][\"sandboxes\"] = []\n",
    "            samples[key][\"vt\"][\"markdown\"] = \"\"\n",
    "\n",
    "            if len(samples[key][\"vt\"][\"popular_threat_classification\"]) > 0:\n",
    "                samples[key][\"vt\"][\"markdown\"] = \"#### Popular threat classifications\\n\"\n",
    "                for el in samples[key][\"vt\"][\"popular_threat_classification\"]:\n",
    "                    if el == \"suggested_threat_label\":\n",
    "                        samples[key][\"vt\"][\"suggested_threat_label\"] = samples[key][\"vt\"][\"popular_threat_classification\"][el]\n",
    "                    samples[key][\"vt\"][\"markdown\"] = \"{}\\n- {}: {}\".format(samples[key][\"vt\"][\"markdown\"], el, samples[key][\"vt\"][\"popular_threat_classification\"][el])\n",
    "\n",
    "            if len(samples[key][\"vt\"][\"sigma_analysis_results\"]) > 0:\n",
    "                samples[key][\"vt\"][\"markdown\"] = \"{}\\n\\n#### Sigma analysis results\\n\".format(samples[key][\"vt\"][\"markdown\"])\n",
    "                for el in samples[key][\"vt\"][\"sigma_analysis_results\"]:\n",
    "                    samples[key][\"vt\"][\"markdown\"] = \"{}\\n - {}\".format(samples[key][\"vt\"][\"markdown\"], el[\"rule_title\"])\n",
    "\n",
    "            if len(samples[key][\"vt\"][\"sandbox_verdicts\"]) > 0:\n",
    "                samples[key][\"vt\"][\"markdown\"] = \"{}\\n\\n#### Sandbox verdicts\\n\".format(samples[key][\"vt\"][\"markdown\"])\n",
    "                for el in samples[key][\"vt\"][\"sandbox_verdicts\"]:\n",
    "                    samples[key][\"vt\"][\"sandboxes\"].append(el)\n",
    "                    samples[key][\"vt\"][\"markdown\"] = \"{}\\n - **{}**\".format(samples[key][\"vt\"][\"markdown\"], el)\n",
    "                    for el_sandbox in samples[key][\"vt\"][\"sandbox_verdicts\"][el]:\n",
    "                        samples[key][\"vt\"][\"markdown\"] = \"{}\\n   - {}: {}\".format(samples[key][\"vt\"][\"markdown\"], el_sandbox, samples[key][\"vt\"][\"sandbox_verdicts\"][el][el_sandbox])\n",
    "        print(\" Finished query VirusTotal directly\")\n",
    "    else:\n",
    "        print(\"No results for \\033[91m{}\\033[90m.\".format(value))\n",
    "    scan_count += 1\n",
    "    if scan_count > 5:\n",
    "        print(\"Sleeping for {} seconds\".format(misp_modules_wait))\n",
    "        time.sleep(misp_modules_wait)\n",
    "        scan_count = 0\n",
    "print(\"Finished {}\".format(module_name))"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "db93d598-3d75-47e3-890a-4d82545a4311",
   "metadata": {},
   "source": [
    "## IN:3 Hashlookup\n",
    "\n",
    "We use [CIRCL Hashlookup](https://www.circl.lu/services/hashlookup/) to identify if any of the samples correspond with **known hashes**."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "9f9082ae-ba18-4532-b500-355c825df1f3",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Loop through the samples\n",
    "\n",
    "module_name = \"hashlookup\"\n",
    "print(\"Lookup the file in {}\".format(module_name))\n",
    "for key, sample in samples.items():\n",
    "    attribute_type = \"sha256\"\n",
    "    value = sample.get(\"sha256\")\n",
    "    module_comment = \"From {} for {}\".format(module_name, key)\n",
    "    data = {\n",
    "        \"attribute\": {\n",
    "            \"type\": f\"{attribute_type}\",\n",
    "            \"uuid\": str(uuid.uuid4()),\n",
    "            \"value\": f\"{value}\",\n",
    "        },\n",
    "        \"module\": module_name,\n",
    "        \"config\": {\"custom_API\": False}\n",
    "    }\n",
    "    print(\"Query \\033[92m{}\\033[90m as \\033[92m{}\\033[90m\".format(value, attribute_type))\n",
    "    result = requests.post(\"{}/query\".format(misp_modules_url), headers=misp_modules_headers, json=data)\n",
    "    if \"results\" in result.json() and len(result.json()[\"results\"]) > 0:\n",
    "        result_json = result.json()[\"results\"]\n",
    "        hashlookup_hit = False\n",
    "        hashlookuptext = \"\"\n",
    "        for misp_attribute in result_json.get(\"Attribute\", []):\n",
    "            misp_attribute[\"comment\"] = \"{}{}\".format(module_comment, misp_attribute.get(\"comment\", \"\"))\n",
    "            created_attribute = misp.add_attribute(misp_event.uuid, misp_attribute, pythonify=True)\n",
    "            if not \"errors\" in created_attribute:\n",
    "                print(\" Got {} \\033[92m{}\\033[90m\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n",
    "                misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_attribute.uuid, \"related-to\"))\n",
    "            else:\n",
    "                print(\" Unable to add {} \\033[92m{}\\033[90m to MISP event\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n",
    "        for misp_object in result_json.get(\"Object\", []):\n",
    "            misp_object[\"comment\"] = \"{}{}\".format(module_comment, misp_object.get(\"comment\", \"\"))\n",
    "            if len(misp_object[\"Attribute\"]) > 0:\n",
    "                created_object = misp.add_object(misp_event.uuid, misp_object, pythonify=True)\n",
    "                if not \"errors\" in created_object:\n",
    "\n",
    "                    print(\" Got \\033[92m{}\\033[90m \".format(misp_object[\"name\"]))\n",
    "                    misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_object.uuid, \"related-to\"))\n",
    "                    for attribute in misp_object.get(\"Attribute\", []):\n",
    "                        if attribute[\"object_relation\"] not in [\"MD5\", \"SHA-1\", \"SHA-256\", \"SSDEEP\"]:\n",
    "                            hashlookuptext = \"{}: {}\\n{}\".format(attribute[\"object_relation\"], attribute[\"value\"], hashlookuptext)\n",
    "                        if attribute.get(\"object_relation\") == \"KnownMalicious\":\n",
    "                            hashlookup_hit = True\n",
    "                else:\n",
    "                    print(\" Unable to add \\033[92m{}\\033[90m to MISP event\".format(misp_object[\"name\"]))\n",
    "\n",
    "        if hashlookup_hit:\n",
    "            misp.tag(sample.get(\"sha256_uuid\"), \"misp-workflow:analysis=\\\"known-file-hash\\\"\")\n",
    "            misp.tag(sample.get(\"malware-sample_uuid\"), \"misp-workflow:analysis=\\\"known-file-hash\\\"\")\n",
    "\n",
    "        sample.get(\"fileobject\").add_attribute(\"text\", hashlookuptext)\n",
    "        samples[key][\"hashlookup\"][\"data\"] = result_json\n",
    "        samples[key][\"hashlookup\"][\"markdown\"] = hashlookuptext\n",
    "        misp.update_object(sample.get(\"fileobject\"))\n",
    "    else:\n",
    "        print(\"No results for \\033[91m{}\\033[90m.\".format(value))\n",
    "print(\"Finished {}\".format(module_name))"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "fc496fae-8ce6-45b5-b9ab-9e561ba99312",
   "metadata": {},
   "source": [
    "## IN:4 MalwareBazaar\n",
    "\n",
    "Now we query [MalwareBazaar](https://bazaar.abuse.ch/). Similar as with VirusTotal, we use the MISP module to get useful attributes and objects. Afterwards we query MalwareBazaar directly to get more details such as vendor intel and tags."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "fa3e236a-94cf-41ff-88b0-6b55aeb18655",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Loop through the samples\n",
    "\n",
    "module_name = \"malwarebazaar\"\n",
    "print(\"Lookup the file in {}\".format(module_name))\n",
    "for key, sample in samples.items():\n",
    "    attribute_type = \"sha256\"\n",
    "    value = sample.get(\"sha256\")\n",
    "    module_comment = \"From {} for {}\".format(module_name, key)\n",
    "    data = {\n",
    "        \"attribute\": {\n",
    "            \"type\": f\"{attribute_type}\",\n",
    "            \"uuid\": str(uuid.uuid4()),\n",
    "            \"value\": f\"{value}\",\n",
    "        },\n",
    "        \"module\": module_name,\n",
    "    }\n",
    "    print(\"Query \\033[92m{}\\033[90m as \\033[92m{}\\033[90m\".format(value, attribute_type))\n",
    "    result = requests.post(\"{}/query\".format(misp_modules_url), headers=misp_modules_headers, json=data)\n",
    "    if \"results\" in result.json() and len(result.json()[\"results\"]) > 0:\n",
    "        result_json = result.json()[\"results\"]\n",
    "        for misp_attribute in result_json.get(\"Attribute\", []):\n",
    "            misp_attribute[\"comment\"] = \"{}{}\".format(module_comment, misp_attribute.get(\"comment\", \"\"))\n",
    "            created_attribute = misp.add_attribute(misp_event.uuid, misp_attribute, pythonify=True)\n",
    "            if not \"errors\" in created_attribute:\n",
    "                print(\" Got {} \\033[92m{}\\033[90m\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n",
    "                misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_attribute.uuid, \"related-to\"))\n",
    "            else:\n",
    "                print(\" Unable to add {} \\033[92m{}\\033[90m to MISP event\".format(misp_attribute[\"type\"], misp_attribute[\"value\"]))\n",
    "        for misp_object in result_json.get(\"Object\", []):\n",
    "            misp_object[\"comment\"] = \"{}{}\".format(module_comment, misp_object.get(\"comment\", \"\"))\n",
    "            if len(misp_object[\"Attribute\"]) > 0:\n",
    "                created_object = misp.add_object(misp_event.uuid, misp_object, pythonify=True)\n",
    "                if not \"errors\" in created_object:\n",
    "                    print(\" Got \\033[92m{}\\033[90m \".format(misp_object[\"name\"]))\n",
    "                    misp.add_object_reference(sample.get(\"fileobject\").add_reference(created_object.uuid, \"related-to\"))\n",
    "                else:\n",
    "                    print(\" Unable to add \\033[92m{}\\033[90m to MISP event\".format(misp_object[\"name\"]))\n",
    "\n",
    "        #################################################################\n",
    "\n",
    "        print(\" Query MalwareBazaar directly to get file details\")\n",
    "        url = \"https://mb-api.abuse.ch/api/v1/\"\n",
    "        result = requests.post(url, data={\"query\": \"get_info\", \"hash\": value})\n",
    "        if \"data\" in result.json() and len(result.json()[\"data\"]) > 0:\n",
    "            result_json = result.json()[\"data\"][0]\n",
    "            samples[key][\"malwarebazaar\"][\"markdown\"] = \"\"\n",
    "            samples[key][\"malwarebazaar\"][\"data\"] = result_json\n",
    "            samples[key][\"malwarebazaar\"][\"tags\"] = result_json.get(\"tags\", \"\")\n",
    "            samples[key][\"malwarebazaar\"][\"signature\"] = result_json.get(\"signature\", \"\")\n",
    "\n",
    "            if len(samples[key][\"malwarebazaar\"][\"tags\"]) > 0:\n",
    "                samples[key][\"malwarebazaar\"][\"markdown\"] = \"{}\\n**Tags**\\n\".format(samples[key][\"malwarebazaar\"][\"markdown\"])\n",
    "                for tag in samples[key][\"malwarebazaar\"][\"tags\"]:\n",
    "                    samples[key][\"malwarebazaar\"][\"markdown\"] = \"{}\\n - {}\\n\".format(samples[key][\"malwarebazaar\"][\"markdown\"], tag)\n",
    "\n",
    "            if len(result_json.get(\"vendor_intel\", [])) > 0:\n",
    "                if len(result_json.get(\"vendor_intel\", []).get(\"Triage\", [])) > 0:\n",
    "                    samples[key][\"malwarebazaar\"][\"markdown\"] = \"{}\\n**Triage tags and signatures**\\n\".format(samples[key][\"malwarebazaar\"][\"markdown\"])\n",
    "                    for tag in result_json.get(\"vendor_intel\", []).get(\"Triage\", []).get(\"tags\", []):\n",
    "                        samples[key][\"malwarebazaar\"][\"markdown\"] = \"{}\\n - {}\\n\".format(samples[key][\"malwarebazaar\"][\"markdown\"], tag)\n",
    "                    for signature in result_json.get(\"vendor_intel\", []).get(\"Triage\", []).get(\"signatures\", []):\n",
    "                        samples[key][\"malwarebazaar\"][\"markdown\"] = \"{}\\n - {}\\n\".format(samples[key][\"malwarebazaar\"][\"markdown\"], signature[\"signature\"])\n",
    "        print(\" Finished query MalwareBazaar directly\")\n",
    "    else:\n",
    "        print(\"No results for \\033[91m{}\\033[90m.\".format(value))\n",
    "print(\"Finished {}\".format(module_name))"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "73ca897a-52f2-43a5-8e9b-e26d37a36298",
   "metadata": {},
   "source": [
    "## IN:5 PE imports and exports\n",
    "\n",
    "Next we use `pefile` to analyse the **imports and exports**. The results are added as a **MISP report**."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "15ea69f1-ee32-4243-b03e-83c7fd155fcf",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "summary_pe = \"## PEFile summary\\n\\n\"\n",
    "print(\"Using pefile\")\n",
    "for key, sample in samples.items():\n",
    "    summary_pe += \"## {}\\n\\n\".format(sample[\"path\"])\n",
    "\n",
    "    try:\n",
    "        pe = pefile.PE(sample[\"path\"])\n",
    "        samples[key][\"pe\"][\"imports\"] = \"### Imports\\n\\n\"\n",
    "        samples[key][\"pe\"][\"exports\"] = \"### Exports\\n\\n\"\n",
    "        pe.parse_data_directories()\n",
    "\n",
    "        print(\" Reading imports for {}\".format(sample[\"path\"]))\n",
    "        try:\n",
    "            for entry in pe.DIRECTORY_ENTRY_IMPORT:\n",
    "                #print(\"  {}\".format(entry.dll))\n",
    "                samples[key][\"pe\"][\"imports\"] = \"{}\\n**{}**\\n\".format(samples[key][\"pe\"][\"imports\"], entry.dll)\n",
    "                for imp in entry.imports:\n",
    "                    samples[key][\"pe\"][\"imports\"] = \"{}\\n - {} {}\\n\".format(samples[key][\"pe\"][\"imports\"], hex(imp.address), imp.name)\n",
    "        except:\n",
    "            print(\"  No imports\")\n",
    "            samples[key][\"pe\"][\"imports\"] = \"{}\\n *No imports*\\n\".format(samples[key][\"pe\"][\"imports\"])\n",
    "        summary_pe += samples[key][\"pe\"][\"imports\"]\n",
    "\n",
    "        print(\" Reading exports for {}\".format(sample[\"path\"]))\n",
    "        try:\n",
    "            for exp in pe.DIRECTORY_ENTRY_EXPORT.symbols:\n",
    "                samples[key][\"pe\"][\"exports\"] = \"{}\\n - {} {} {}\\n\".format(samples[key][\"pe\"][\"exports\"], hex(pe.OPTIONAL_HEADER.ImageBase + exp.address), exp.name, exp.ordinal)\n",
    "        except:\n",
    "            print(\"  No exports\")\n",
    "            samples[key][\"pe\"][\"exports\"] = \"{}\\n *No exports*\\n\".format(samples[key][\"pe\"][\"exports\"])\n",
    "        summary_pe += samples[key][\"pe\"][\"imports\"]\n",
    "    except:\n",
    "        print(\" \\033[91mPEFormatError\\033[90m for {}\\n\".format(sample[\"path\"]))\n",
    "        summary_pe += \"\\n*PEFormatError*\\n\"\n",
    "\n",
    "event_title = \"PEFile analysis\"\n",
    "print(\"\\nCreating MISP report \\033[92m{}\\033[90m\".format(event_title))\n",
    "chunk_size = 61500\n",
    "for i in range(0, len(summary_pe), chunk_size):\n",
    "    chunk = summary_pe[i:i + chunk_size]\n",
    "    event_report = MISPEventReport()\n",
    "    event_title_edit = event_title\n",
    "    if i > 0:\n",
    "        event_title_edit = \"{} ({} > {})\".format(event_title, i, i + chunk_size)\n",
    "    event_report.name = event_title_edit\n",
    "    event_report.content = chunk\n",
    "    result = misp.add_event_report(misp_event.id, event_report)\n",
    "    if \"EventReport\" in result:\n",
    "        print(\" Report ID: \\033[92m{}\\033[90m\".format(result.get(\"EventReport\", []).get(\"id\", 0)))\n",
    "    else:\n",
    "        print(\"Failed to create report for \\033[91m{}\\033[90m.\".format(event_title))"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "8ac93da8-42b4-46a7-a02d-609ed92ce715",
   "metadata": {},
   "source": [
    "## IN:6 MISP report for investigation\n",
    "\n",
    "Apart from the report on PE data, we also create a report on the previous investigation steps."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "116fa918-03b6-4b18-ba4b-42814c942bcc",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "summary_iv = \"## Investigation report\\n\\n\"\n",
    "current_date = datetime.now()\n",
    "formatted_date = current_date.strftime(\"%Y-%m-%d\")\n",
    "for key, sample in samples.items():\n",
    "    summary_iv += \"## Analysis for {}\\n\".format(key)\n",
    "    summary_iv += \"- Date: **{}**\\n\".format(formatted_date)\n",
    "    summary_iv += \"- MISP event: **{}** ({})\\n\".format(misp_event.info, misp_event.id)\n",
    "    summary_iv += \"### File details\\n\"\n",
    "    summary_iv += \" - Path: {}\\n\".format(sample[\"path\"])\n",
    "    summary_iv += \" - MD5: {}\\n\".format(sample[\"md5\"])\n",
    "    summary_iv += \" - SHA256: {}\\n\".format(sample[\"sha256\"])\n",
    "    summary_iv += \" - Entropy: {}\\n\".format(sample[\"entropy\"])\n",
    "    summary_iv += \" - Mime-type: {}\\n\".format(sample[\"mimetype\"])\n",
    "    summary_iv += \"### Risk\\n\"\n",
    "    if sample.get(\"detection_ratio\", False):\n",
    "        summary_iv += \" - Detection ratio: **{}**\\n\".format(sample[\"detection_ratio\"])\n",
    "        summary_iv += \" - Detection ratio percentage: **{}**\\n\".format(sample[\"detection_ratio_perc\"])\n",
    "        summary_iv += \" - Detection ratio indirect: {}\\n\".format(sample[\"detection_ratio_indirect\"])\n",
    "        summary_iv += \" - Detection ratio indirect percentage: {}\\n\".format(sample[\"detection_ratio_indirect_perc\"])\n",
    "    else:\n",
    "        summary_iv += \"*No detection ratio found*\\n\\n\"\n",
    "    if sample.get(\"risk_rating\", False):\n",
    "        summary_iv += \" - Risk rating: **{}**\\n\".format(sample[\"risk_rating\"])\n",
    "    summary_iv += \"### VirusTotal\\n\"\n",
    "    if sample[\"vt\"].get(\"markdown\"):\n",
    "        summary_iv += sample[\"vt\"][\"markdown\"]\n",
    "    else:\n",
    "        summary_iv += \"*No results*\\n\"\n",
    "    summary_iv += \"\\n\\n\"\n",
    "    summary_iv += \"### Hashlookup\\n\"\n",
    "    if sample[\"hashlookup\"].get(\"markdown\"):\n",
    "        summary_iv += sample[\"hashlookup\"][\"markdown\"]\n",
    "    else:\n",
    "        summary_iv += \"*No results*\\n\"\n",
    "    summary_iv += \"\\n\\n\"\n",
    "    summary_iv += \"### MalwareBazaar\\n\"\n",
    "    if sample[\"malwarebazaar\"].get(\"markdown\"):\n",
    "        summary_iv += sample[\"malwarebazaar\"][\"markdown\"]\n",
    "    else:\n",
    "        summary_iv += \"*No results*\\n\"\n",
    "    summary_iv += \"\\n\\n\"\n",
    "\n",
    "event_title = \"Malware analysis\"\n",
    "print(\"Creating MISP report \\033[92m{}\\033[90m\".format(event_title))\n",
    "chunk_size = 61500\n",
    "for i in range(0, len(summary_iv), chunk_size):\n",
    "    chunk = summary_iv[i:i + chunk_size]\n",
    "    event_report = MISPEventReport()\n",
    "    event_title_edit = event_title\n",
    "    if i > 0:\n",
    "        event_title_edit = \"{} ({} > {})\".format(event_title, i, i + chunk_size)\n",
    "    event_report.name = event_title_edit\n",
    "    event_report.content = chunk\n",
    "    result = misp.add_event_report(misp_event.id, event_report)\n",
    "    if \"EventReport\" in result:\n",
    "        print(\" Report ID: \\033[92m{}\\033[90m\".format(result.get(\"EventReport\", []).get(\"id\", 0)))\n",
    "    else:\n",
    "        print(\"Failed to create report for \\033[91m{}\\033[90m.\".format(event_title))"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "2882c87b-4fe6-4bed-b0a2-5948b5463fc4",
   "metadata": {},
   "source": [
    "# Correlation\n",
    "\n",
    "## CR:1 Correlation with MISP events\n",
    "\n",
    "This cell searches the **MISP server** for events that have a match with the analysed samples.\n",
    "\n",
    "Only **published** events (`correlation_published`) and attributes that have the **to_ids** flag (`correlation_to_ids`) set are take into account. There is a default limit of **1000 hits** (`correlation_limit`) and you can limit the search with tags (`correlation_match_tags`)."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "d3160c9e-fff3-43d0-b20a-4c691eab6a61",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Only query for published MISP events\n",
    "correlation_published = False\n",
    "\n",
    "# Only consider those values that have the to_ids field set to True\n",
    "correlation_to_ids = True\n",
    "\n",
    "# Limit the returned results to 1000 attributes\n",
    "correlation_limit = 1000\n",
    "\n",
    "# Only return results corresponding with these tags\n",
    "correlation_match_tags = [\"tlp:amber\", \"tlp:white\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "4d9f3f7e-4e8d-45d7-a057-33ed1e805338",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "print(\"Search for correlating MISP events\")\n",
    "# Code block to query MISP and find the correlations\n",
    "for key, sample in samples.items():\n",
    "    value = [sample.get(\"sha256\"), sample.get(\"md5\")]\n",
    "    search_match = misp.search(\"attributes\", to_ids=correlation_to_ids, value=value, tags=correlation_match_tags,\n",
    "                                   published=correlation_published, limit=correlation_limit, pythonify=True)\n",
    "    if len(search_match) > 0:        \n",
    "        for attribute in search_match:\n",
    "            if attribute.Event.id != misp_event.id:   # Skip the event we just created for this playbook\n",
    "                print(\" Found \\033[92m{}\\033[90m in \\033[92m{}\\033[90m ({})\".format(attribute.value, attribute.Event.info, attribute.Event.id ))\n",
    "                entry = {\"source\": \"MISP\", \"category\": attribute.category, \"type\": attribute.type, \"event_id\": attribute.Event.id, \"event_info\": attribute.Event.info}\n",
    "                samples[key][\"MISP\"].append(entry)\n",
    "print(\"Finished searching for correlations\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "e113f340-761b-4fce-b27e-d0dffabf8a69",
   "metadata": {},
   "source": [
    "### MISP events correlation table\n",
    "\n",
    "The correlation results are now stored in `samples`. Execute the next cell to display them in a table format. The table is also included in the summary sent to Mattermost and TheHive."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "28c33600-47c7-4e8d-bc10-6333aded3cf4",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Put the correlations in a pretty table. We can use this table later also for the summary\n",
    "table = PrettyTable()\n",
    "table.field_names = [\"Source\", \"Value\", \"Category\", \"Type\", \"Event\", \"Event ID\"]\n",
    "table.align[\"Value\"] = \"l\"\n",
    "table.align[\"Category\"] = \"l\"\n",
    "table.align[\"Type\"] = \"l\"\n",
    "table.align[\"Event\"] = \"l\"\n",
    "table.align[\"Event ID\"] = \"l\"\n",
    "table._max_width = {\"Event\": 50}\n",
    "for key, sample in samples.items():\n",
    "    for match in sample[\"MISP\"]:\n",
    "        table.add_row([match[\"source\"], key, match[\"category\"], match[\"type\"], match[\"event_info\"], match[\"event_id\"]])\n",
    "print(table.get_string(sortby=\"Value\"))\n",
    "table_mispevents = table"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "4c3266b4-9288-42a5-b610-c50ec4c1ca90",
   "metadata": {},
   "source": [
    "## CR:2 Correlation with MISP feeds\n",
    "\n",
    "This cell searches the **MISP feeds** for events that have a match with the analysed samples. The output of this cell is a table with all the matches. The output is also repeated at the end of the playbook.\n",
    "\n",
    "Note that the correlation lookup in the MISP feeds does not return the name of the MISP event, it returns the UUID of the event as title."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "6a785ced-8210-4097-9408-e56380221a4f",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "print(\"Search in MISP feeds\")\n",
    "misp_cache_url = \"{}/feeds/searchCaches/\".format(misp_url)\n",
    "match = False\n",
    "for key, sample in samples.items():\n",
    "    # Instead of GET, use POST (https://github.com/MISP/MISP/issues/7478)\n",
    "    value = sample.get(\"sha256\")\n",
    "    cache_results = requests.post(misp_cache_url, headers=misp_headers, verify=misp_verifycert, json={\"value\": value})\n",
    "    for result in cache_results.json():\n",
    "        if \"Feed\" in result:\n",
    "            match = True\n",
    "            print(\" Found \\033[92m{}\\033[90m in \\033[92m{}\\033[90m\".format(value, result[\"Feed\"][\"name\"]))\n",
    "            for match in result[\"Feed\"][\"direct_urls\"]:\n",
    "                entry = {\"source\": \"Feeds\", \"feed_name\": result[\"Feed\"][\"name\"], \"match_url\": match[\"url\"]}\n",
    "                samples[key][\"Feeds\"].append(entry)\n",
    "\n",
    "print(\"Finished searching in MISP feeds\")\n",
    "if not match:\n",
    "    print(\"\\033[93mNo correlating information found in MISP feeds.\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "022c3c4f-cf59-469b-8cc6-b9917ff5a6de",
   "metadata": {},
   "source": [
    "### MISP feed correlations table\n",
    "\n",
    "The correlation results are now stored in `samples`. Execute the next cell to display them in a table format. The table is also included in the summary sent to Mattermost and TheHive."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "93de12ec-9896-4ac0-aada-93685b390553",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Put the correlations in a pretty table. We can use this table later also for the summary\n",
    "table = PrettyTable()\n",
    "table.field_names = [\"Source\", \"Value\", \"Feed\", \"URL\"]\n",
    "table.align[\"Value\"] = \"l\"\n",
    "table.align[\"Feed\"] = \"l\"\n",
    "table.align[\"Feed URL\"] = \"l\"\n",
    "table._max_width = {\"Feed\": 50}\n",
    "for key, sample in samples.items():\n",
    "    for match in sample[\"Feeds\"]:\n",
    "        table.add_row([match[\"source\"], key, match[\"feed_name\"], match[\"match_url\"]])\n",
    "print(table.get_string(sortby=\"Value\"))\n",
    "table_mispfeeds = table"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "f02c79cb-e820-44b7-897f-1562726ef938",
   "metadata": {},
   "source": [
    "# Share the sample\n",
    "\n",
    "## SA:1 Store in MWDB\n",
    "\n",
    "The Malware Repository MWDB, formerly known as Malwarecage, is a project from CERT.pl that is available as a service (via [MWDB-CERT.pl](https://mwdb.cert.pl/login) but you can also host its core component [MWDB](https://github.com/CERT-Polska/mwdb-core) on your own infrastructure. In this playbook there's the option to store the sample in your instance of MWDB. Future variants of the playbook include sharing the sample with well-known malware sandboxes.\n",
    "\n",
    "There is a MISP module that allows to upload samples from MISP to MWDB, but in this case the playbook will interact directly with MWDB."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "68cb08ed-76dd-4706-819c-5927eb85a9e9",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "mwdb_public = True\n",
    "mwdb_metakeys = False\n",
    "mwdb = MWDB(api_key=mwdb_apikey, api_url=mwdb_api_url)\n",
    "mwdb_tags = [\"misp\", \"playbook\"]\n",
    "\n",
    "summary_mwdb = \"## Samples stored in MWDB\\n\"\n",
    "print(\"Start sharing with MWDB\")\n",
    "for key, sample in samples.items():\n",
    "    sample_filename = key\n",
    "    with open(sample[\"path\"], 'rb') as file:\n",
    "        data = file.read()\n",
    "    file_object = mwdb.upload_file(sample_filename, data, metakeys=mwdb_metakeys, public=mwdb_public)\n",
    "    for tag in mwdb_tags:\n",
    "        file_object.add_tag(tag)\n",
    "    file_object.add_comment(\"Uploaded from MISP playbook for malware triage, via event {}\".format(misp_event.id))\n",
    "    mwdb_link = \"{}{}\".format(mwdb_api_url.replace(\"/api\", \"/file/\"), file_object.md5)\n",
    "    attribute = MISPAttribute()\n",
    "    attribute.value = mwdb_link\n",
    "    attribute.to_ids = False\n",
    "    attribute.type = \"link\"\n",
    "    attribute.disable_correlation = True\n",
    "    attribute.comment = \"MWDB link for sample {}\".format(key)\n",
    "    attribute_mwdb = misp.add_attribute(misp_event.uuid, attribute, pythonify=True)\n",
    "    summary_mwdb += \" {} in [{}]({})\\n\".format(key, mwdb_link, mwdb_link)\n",
    "    if not \"errors\" in attribute_mwdb:\n",
    "        misp.add_object_reference(sample[\"fileobject\"].add_reference(attribute_mwdb.uuid, \"related-to\"))\n",
    "    else:\n",
    "        print(\" Unable to add attribute with link to MWDB to MISP\")\n",
    "    print(\" Sample {} at \\033[92m{}\\033[90m\".format(key, mwdb_link))\n",
    "summary_mwdb += \"\\n\"\n",
    "print(\"Finished sharing\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "c8bce07e-7109-4ae2-b0c1-c040383b5dee",
   "metadata": {},
   "source": [
    "# Closure\n",
    "\n",
    "In this **closure** or end step we create a **summary** of the actions that were performed by the playbook. The summary is printed in the playbook and can also be send to a chat channel. "
   ]
  },
  {
   "cell_type": "markdown",
   "id": "9ece6cba-58bd-4238-9204-44e3a50cd759",
   "metadata": {},
   "source": [
    "## EN:1 MISP indicators\n",
    "\n",
    "The next section first **queries MISP for the indicators added to the MISP event** that is linked to the execution of this playbook.\n",
    "\n",
    "The indicators are stored in the variable `indicator_table` (table format) and `indicator_raw_list` (in raw format) which is used in a later section to create the playbook summary."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "ed8ec3d2-01da-4e1a-ab5f-511a46e5dae8",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Get all the indicators for our event and store this is in a table. We can also use this for the summary.\n",
    "indicator_search = misp.search(\"attributes\", uuid=misp_event.uuid, to_ids=True, pythonify=True)\n",
    "indicator_raw_list = []\n",
    "indicator_table = PrettyTable()\n",
    "if len(indicator_search) > 0:\n",
    "    indicator_table.field_names = [\"Type\", \"Category\", \"Indicator\", \"Comment\"]\n",
    "    indicator_table.align[\"Type\"] = \"l\"\n",
    "    indicator_table.align[\"Category\"] = \"l\"\n",
    "    indicator_table.align[\"Indicator\"] = \"l\"\n",
    "    indicator_table.align[\"Comment\"] = \"l\"\n",
    "    indicator_table.border = True\n",
    "    for indicator in indicator_search:\n",
    "        if indicator.value not in indicator_raw_list:\n",
    "            comment = indicator.comment\n",
    "            if hasattr(indicator, 'object_relation'):\n",
    "                object_ind = misp.get_object(indicator.object_id, pythonify=True)\n",
    "                comment = \"From '{}' object {} {}\".format(object_ind.name, object_ind.comment.lower(), comment.lower())\n",
    "            indicator_table.add_row([indicator.type, indicator.category, indicator.value, comment])\n",
    "            indicator_raw_list.append(indicator.value)\n",
    "    print(\"Got \\033[92m{}\\033[90m indicator(s) from the event \\033[92m{}\\033[90m ({}).\\n\".format(len(indicator_raw_list), misp_event.info, misp_event.id))\n",
    "else:\n",
    "    print(\"\\033[93mNo indicators found in the event \\033[92m{}\\033[90m ({})\".format(misp_event.info, misp_event.id))"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "85883666-7172-46d5-b6c5-ffe2dc296df4",
   "metadata": {},
   "source": [
    "### Raw list of MISP indicators\n",
    "\n",
    "The indicators are now stored in `indicator_search` (as Python objects) and `indicator_raw_list` (in raw format, only the indicators). Execute the next cell to display them in a table format. The table is also included in the summary sent to Mattermost and TheHive."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "a05e1326-8473-4c25-8b40-4825e727045a",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "if len(indicator_raw_list) > 0:\n",
    "    print(indicator_table.get_string(sortby=\"Type\"))\n",
    "    print(\"\\n\\nIndicator list in raw format:\")\n",
    "    print(\"---------------------------------------------------\")\n",
    "    for el in indicator_raw_list:\n",
    "        print(\"{}\".format(el))\n",
    "    print(\"---------------------------------------------------\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "08ed76cf-6ebf-425b-83a3-bae1acf41771",
   "metadata": {},
   "source": [
    "## EN:2 Create the summary of the playbook\n",
    "\n",
    "The next section creates a summary and stores the output in the variable `summary` in Markdown format. It also stores an intro text in the variable `intro`. These variables are later used when sending information to Mattermost or TheHive."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "3b55bd09-b440-4925-a3af-be62ec7c1c39",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "summary = \"# MISP Playbook summary\\nMalware triage with MISP event: **{}** ({}/events/view/{}). \".format(misp_event.info, misp_url, misp_event.id)\n",
    "\n",
    "summary += \"\\n\"\n",
    "summary += summary_iv\n",
    "summary += \"\\n\"\n",
    "intro = summary\n",
    "\n",
    "summary += \"## Indicators\\n\\n\"\n",
    "summary += \"### Indicators table\\n\\n\"\n",
    "if len(indicator_raw_list) > 0:\n",
    "    indicator_table.set_style(MARKDOWN)\n",
    "    summary += indicator_table.get_string(sortby=\"Type\")\n",
    "    summary += \"\\n\\n\\n\" \n",
    "    summary += \"### Indicators in **raw format**\\n\\n\"\n",
    "    for indicator in indicator_raw_list:\n",
    "        summary += \"{}\\n\\n\".format(indicator)\n",
    "    summary += \"\\n\" \n",
    "else:\n",
    "    summary += \"There are no indicators\"\n",
    "summary += \"\\n\\n\"\n",
    "\n",
    "summary += \"## Correlations\\n\\n\"\n",
    "summary += \"### MISP event matches\\n\\n\"\n",
    "table_mispevents.set_style(MARKDOWN)\n",
    "summary += table_mispevents.get_string()\n",
    "summary += \"\\n\\n\"\n",
    "\n",
    "summary += \"### MISP feed matches\\n\\n\"\n",
    "table_mispfeeds.set_style(MARKDOWN)\n",
    "summary += table_mispfeeds.get_string()\n",
    "summary += \"\\n\\n\"\n",
    "\n",
    "summary += summary_pe\n",
    "\n",
    "summary += \"\\n\\n\"\n",
    "\n",
    "print(\"The \\033[92msummary\\033[90m of the playbook is available.\\n\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "e9aa590d-560a-44f7-bd78-c2e0e9650bba",
   "metadata": {},
   "source": [
    "## EN:3 Print the input for malware triage\n",
    "\n",
    "Apart from the full summary of the investigation, the malware triage summary provides the necessary input for the analyst. This summary was previously also added as a MISP report."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "f40135b0-6682-48b8-b5c8-3692a20d5887",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "print(summary_iv)\n",
    "# Or print with parsed markdown\n",
    "# display_markdown(summary_iv, raw=True)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "eba96062-deaa-4fb9-8302-7b3ef829201e",
   "metadata": {},
   "source": [
    "## EN:4 Send a summary to Mattermost\n",
    "\n",
    "Now you can send the summary to Mattermost. You can send the summary in two ways by selecting one of the options for the variable `send_to_mattermost_option` in the next cell.\n",
    "\n",
    "- The default option where the entire summary is in the **chat**, or\n",
    "- a short intro and the summary in a **card**\n",
    "\n",
    "For this playbook we rely on a webhook in Mattermost. You can add a webhook by choosing the gear icon in Mattermost, then choose Integrations and then **Incoming Webhooks**. Set a channel for the webhook and lock the webhook to this channel with *\"Lock to this channel\"*."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "8c455bc3-35cc-436d-8237-95c81b57208f",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "send_to_mattermost_option = \"via a chat message\"\n",
    "#send_to_mattermost_option = \"via a chat message with card\""
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "ca49f33e",
   "metadata": {},
   "outputs": [],
   "source": [
    "message = False\n",
    "if send_to_mattermost_option == \"via a chat message\":\n",
    "    message = {\"username\": mattermost_playbook_user, \"text\": summary}\n",
    "elif send_to_mattermost_option == \"via a chat message with card\":\n",
    "    message = {\"username\": mattermost_playbook_user, \"text\": intro, \"props\": {\"card\": summary}}\n",
    "\n",
    "if message:\n",
    "    r = requests.post(mattermost_hook, data=json.dumps(message))\n",
    "    r.raise_for_status()\n",
    "if message and r.status_code == 200:\n",
    "    print(\"Summary is \\033[92msent to Mattermost.\\n\")\n",
    "else:\n",
    "    print(\"\\033[91mFailed to sent summary\\033[90m to Mattermost.\\n\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "498b8eff-7d1b-47b7-90c4-e13cc738b36c",
   "metadata": {},
   "source": [
    "## EN:5 End of the playbook "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "4c03bb30",
   "metadata": {},
   "outputs": [],
   "source": [
    "print(\"\\033[92m End of the playbook\")\n"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "b17af9f2",
   "metadata": {},
   "source": [
    "## External references <a name=\"extreferences\"></a>\n",
    "\n",
    "- [The MISP Project](https://www.misp-project.org/)\n",
    "- [Mattermost](https://mattermost.com/)\n",
    "- [MWDB](https://github.com/CERT-Polska/mwdb-core)\n",
    "- [MalwareBazaar](https://bazaar.abuse.ch/)\n",
    "- [Hashlookup](https://www.circl.lu/services/hashlookup/)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "e1c813f6",
   "metadata": {},
   "source": [
    "## Technical details \n",
    "\n",
    "### Documentation\n",
    "\n",
    "This playbook requires these Python **libraries** to exist in the environment where the playbook is executed. You can install them with `pip install <library>`.\n",
    "\n",
    "```\n",
    "pyfaup\n",
    "chardet\n",
    "PrettyTable\n",
    "ipywidgets\n",
    "pefile\n",
    "mwdblib\n",
    "```\n",
    "\n",
    "### Colour codes\n",
    "\n",
    "The output from Python displays some text in different colours. These are the colour codes\n",
    "\n",
    "```\n",
    "Red = '\\033[91m'\n",
    "Green = '\\033[92m'\n",
    "Blue = '\\033[94m'\n",
    "Cyan = '\\033[96m'\n",
    "White = '\\033[97m'\n",
    "Yellow = '\\033[93m'\n",
    "Magenta = '\\033[95m'\n",
    "Grey = '\\033[90m'\n",
    "Black = '\\033[90m'\n",
    "Default = '\\033[99m'\n",
    "```"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "14dce812-e432-4c18-8358-f8b66e5b94a5",
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "celltoolbar": "Attachments",
  "kernelspec": {
   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.9.6"
  },
  "toc-showcode": false,
  "toc-showmarkdowntxt": false
 },
 "nbformat": 4,
 "nbformat_minor": 5
}
"
}
],
"agent": "group--95d64ad6-94d0-4368-b6a2-a83a61eda611",
"targets": [
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1"
],
"on_completion": "end--3901ed01-4f4d-423a-a2fe-cec4b9def7c1"
},
"end--3901ed01-4f4d-423a-a2fe-cec4b9def7c1": {
"type": "end",
"name": "End of Playbook"
}
},
"agent_definitions": {
"group--95d64ad6-94d0-4368-b6a2-a83a61eda611": {
"type": "group",
"name": "MISP playbook users",
"description": "Users responsible for executing MISP playbooks"
}
},
"target_definitions": {
"security-category--2e12720a-0d96-4149-b39e-1206b965f0b1": {
"type": "security-category",
"name": "MISP playbooks",
"category": [
"MISP playbook",
"Jupyter Notebook infrastructure"
]
}
},
"external_references": [
{
"name": "MISP",
"url": "https://github.com/MISP/MISP"
},
{
"name": "MISP Playbook GitHub Repository",
"url": "https://github.com/MISP/misp-playbooks"
},
{
"url": "https://www.misp-project.org/)",
"name": "https://www.misp-project.org/)"
},
{
"url": "https://mattermost.com/)",
"name": "https://mattermost.com/)"
},
{
"url": "https://github.com/CERT-Polska/mwdb-core)",
"name": "https://github.com/CERT-Polska/mwdb-core)"
},
{
"url": "https://bazaar.abuse.ch/)",
"name": "https://bazaar.abuse.ch/)"
},
{
"url": "https://www.circl.lu/services/hashlookup/)",
"name": "https://www.circl.lu/services/hashlookup/)"
}
]
}