Threat Landscape Training Corpus

This directory contains a curated collection of PDFs designed to train both humans and language models (LLMs) in discussing cyber threats and threat landscapes effectively. The purpose is to establish the biggest archive of historical threat landscapes. The reality is that most companies publish these documents as part of marketing efforts, whenever changes happen; links break. This archive is meant to prevent these documents from becoming unavailable. Acting as curated library for cyber threat landscape cartographers.

Purpose

The primary purpose of this collection is to serve as a comprehensive resource for enhancing the capabilities of language models in understanding and articulating cyber threats and their landscapes. How to Use

Download the PDFs: Ensure you have access to all the PDFs within this directory.
Integrate with LLM: Feed these documents into your language model training pipeline.
Training: Use the documents to train your model to improve its proficiency in discussing and analyzing cyber threat landscapes.

Timeline

Current timeline:

2019: Completed
2021: Completed
2022: Completed
2023: Processing
2024: Ongoing

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
2020		2020
2021		2021
2022		2022
2023		2023
2024		2024
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Threat Landscape Training Corpus

Contents

Purpose

Timeline

About

gertjanbruggink/threat-landscape-training-corpus

Folders and files

Latest commit

History

Repository files navigation

Threat Landscape Training Corpus

Contents

Purpose

Timeline

About

Topics

Resources

Stars

Watchers

Forks