Skip to content

This directory contains PDFs to train both humans & models in discussing cyber threats and threat landscapes.

Notifications You must be signed in to change notification settings

gertjanbruggink/threat-landscape-training-corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Threat Landscape Training Corpus

This directory contains a curated collection of PDFs designed to train both humans and language models (LLMs) in discussing cyber threats and threat landscapes effectively. The purpose is to establish the biggest archive of historical threat landscapes. The reality is that most companies publish these documents as part of marketing efforts, whenever changes happen; links break. This archive is meant to prevent these documents from becoming unavailable. Acting as curated library for cyber threat landscape cartographers.

2024 - cyber threat cartographers

Contents

The corpus includes a variety of documents covering topics such as:

  • Cyber threat landscapes
  • Cyber threats
  • Vulnerabilities
  • Threat intelligence
  • Defense & mitigation strategies

Purpose

The primary purpose of this collection is to serve as a comprehensive resource for enhancing the capabilities of language models in understanding and articulating cyber threats and their landscapes. How to Use

  • Download the PDFs: Ensure you have access to all the PDFs within this directory.
  • Integrate with LLM: Feed these documents into your language model training pipeline.
  • Training: Use the documents to train your model to improve its proficiency in discussing and analyzing cyber threat landscapes.

Timeline

Current timeline:

  • 2019: Completed
  • 2021: Completed
  • 2022: Completed
  • 2023: Processing
  • 2024: Ongoing

About

This directory contains PDFs to train both humans & models in discussing cyber threats and threat landscapes.

Topics

Resources

Stars

Watchers

Forks