Adding base functionality for source confidence scoring #439

redsand · 2020-11-05T15:43:26Z

Adds base functionality for new scoring technique. Implemented as a misp module, hoping to see it brought internally to the app

modified: misp_modules/modules/expansion/__init__.py new file: misp_modules/modules/expansion/source_confidence.py new file: tools/misp-builddb.py

…tributes

…exists.

…art over... so annoying.

… greater than 30 seconds, i chose 900

…iable names, kids.

misp_modules/modules/expansion/__init__.py

lgtm-com · 2020-11-05T15:54:15Z

This pull request introduces 24 alerts when merging e1e7d49 into 900fe56 - view on LGTM.com

new alerts:

17 for Unused import
4 for Except block handles 'BaseException'
2 for Unused local variable
1 for Module is imported more than once

lgtm-com · 2020-11-05T16:11:45Z

This pull request introduces 13 alerts when merging 7e77058 into 900fe56 - view on LGTM.com

new alerts:

7 for Unused import
4 for Except block handles 'BaseException'
2 for Unused local variable

…ce_confidence

lgtm-com · 2020-11-11T00:22:13Z

This pull request introduces 13 alerts when merging 79acdec into ab23547 - view on LGTM.com

new alerts:

7 for Unused import
4 for Except block handles 'BaseException'
2 for Unused local variable

mokaddem · 2020-11-20T13:33:41Z

Hello @redsand!
Thanks a lot for your pull request. Please find my comments below:

I am curious why this script is not relaying on the built-in decaying model of MISP.
- Are you missing something that is not implemented or not working the way you'd like in the default MISP's implementation?
I see some debugging leftovers (comments and commented code) which should be cleaned
Querying back MISP for every value is extremely costly and not doable in a production system
- misp-modules/misp_modules/modules/expansion/source_confidence.py
  
  Line 152 in 79acdec
  
  results = misp.search(value=input_attribute['value'])
- You could query back MISP from another script but having this step in the pipeline for every value is too costly
  - You have the query handshake + authentication + full database search + returning the value + processing the output
The way I would have seen this module only handle the confidence part and relying on the MISP's built-in decaying implementation. So, only this part
- misp-modules/misp_modules/modules/expansion/source_confidence.py
  
  Line 211 in 79acdec
  
  final_score = ( total_score / confidence) * 100.0 # make it a pct
- Where the complete steps would be:
  1. User issue a restSearch with decaying enabled and filtering out expired data
  2. MISP compute decaying score
  3. MISP provide the score and more data to the MISP module
  4. The MISP-module returns back either a weight or the modified score
  5. MISP filter out results based on the MISP-module feedback and return data to the user

Let us know what you think!

redsand · 2021-05-20T11:03:08Z

This implementation was chosen because it was recommended per our meeting with the MISP team last year on a conference call. I am not familiar with the broader MISP project's codebase, per this suggestion.
I can certainly remove any debugging output, oopsie!
Querying back for all the data for the attribute is required for properly calculating the score (total_score), since its a representation of the attribute and its properties for each source provider. I have solved for the cost by precalculating all attributes and updating their scores periodically. More specifically, for our implementation internally, all items are scored and exported out as csv's for real-time processing of our MDR platform.
This is meant for your team to better understand how the paper is written and identify the best way for this feature to be applied at the production level. I noticed several workflows (as you have) that do not compliment the method of how the research paper was written. For us, we are able to use the source confidence tables along with processing the data on export to calculate all values at that time, and we simply then perform this export every X hours or days.

redsand added 13 commits September 17, 2020 14:17

v0.1 of new source confidence enrichment module

8eacfb7

modified: misp_modules/modules/expansion/__init__.py new file: misp_modules/modules/expansion/source_confidence.py new file: tools/misp-builddb.py

Adding support for detection of Objects and their Attributes as well.

9d56743

fixing errors during qa

5c1b7c0

fixing incorrect object reference

7b5969f

another incorrect object referencde when adding support for Object At…

57aeeb5

…tributes

working on understanding the data formats and how to improve them

85d7e46

fixes bug where exception hits line 152 and os.unlink assumes a file …

a66a21b

…exists.

adding example default out score confidence.

0c96f14

adding retry capabilities, if it fails once my whole build had to st…

46210dc

…art over... so annoying.

added more error handling, max execution time of php needs to be much…

580c93a

… greater than 30 seconds, i chose 900

removing nested exceptions

2ae552b

fixes error where var e is overwritten. this is why we use longer var…

861edfe

…iable names, kids.

Merge branch 'main' of https://github.com/MISP/misp-modules into main

e1e7d49

misp_modules/modules/expansion/__init__.py

removing un-needed imports

7e77058

Merge branch 'main' of https://github.com/MISP/misp-modules into sour…

79acdec

…ce_confidence

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding base functionality for source confidence scoring #439

Adding base functionality for source confidence scoring #439

redsand commented Nov 5, 2020

lgtm-com bot commented Nov 5, 2020

lgtm-com bot commented Nov 5, 2020

lgtm-com bot commented Nov 11, 2020

mokaddem commented Nov 20, 2020

redsand commented May 20, 2021

Adding base functionality for source confidence scoring #439

Are you sure you want to change the base?

Adding base functionality for source confidence scoring #439

Conversation

redsand commented Nov 5, 2020

lgtm-com bot commented Nov 5, 2020

lgtm-com bot commented Nov 5, 2020

lgtm-com bot commented Nov 11, 2020

mokaddem commented Nov 20, 2020

redsand commented May 20, 2021