Document Classification Script

This script demo.py classifies a given document into a category, identifies relevant narratives, and determines sub-narratives using a fine-tuned language model.

Usage

python demo.py <document_path> <model_path>

<document_path>: Path to the text file containing the document.
<model_path>: Path to the fine-tuned model directory.

Output Format

The script prints the results in the following format:

Filename: <document_path>
Category: <Category>
Narratives (Format - Category: Main Narrative: Sub-Narrative):
 - <Category>: <Main Narrative>: <Sub-Narrative>
 - ...
-------------------------

For example:

Filename: sample.txt
Category: Ukraine-Russia War
Narratives (Format - Category: Main Narrative: Sub-Narrative):
 - CC: Criticism of climate movement: Climate movement is corrupt
 - CC: Criticism of climate movement: Climate movement is alarmist
 - CC: Questioning the measurements and science: Scientific community is unreliable
-------------------------

Taxonomy details here

Classification Process

Category Classification: Determines if the document belongs to:
- "Ukraine-Russia War" (URW)
- "Climate Change" (CC)
- "Other" (if no relevant category applies)
Main Narrative Identification: Selects the most relevant narrative(s) based on predefined categories.
Sub-Narrative Identification: Further classifies into specific sub-narratives.

Dependencies

Python 3.x
unsloth for model inference
JSON files in Dataset/ for narrative mappings

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Dataset		Dataset
GenerateSyntheticData		GenerateSyntheticData
UnslothTrain		UnslothTrain
README.md		README.md
demo.py		demo.py
test_set_predictions.ipynb		test_set_predictions.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document Classification Script

Usage

Output Format

Classification Process

Dependencies

About

Releases

Packages

Languages

GateNLP/H3Prompt

Folders and files

Latest commit

History

Repository files navigation

Document Classification Script

Usage

Output Format

Classification Process

Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages