Skip to content

DeerLu1220/nlp2024_homeworks

Repository files navigation

hw1a

Get the cookies for the drive page otherwise the script doesn't work (see official gdwon page) You also need the evaluation .csv in the root folder.

hw1b_nlp2024

After having put in the root folder of the repo the .csv file containing the list of students and their assigned datasets, run:

python rebalance_datasets_hw1b.py

The .csv schema should be as follow:

['University ID',   # mandatory in position 1
 'In need for Italian speakers or for more project members (Y/N)',
 'Group ID (make sure the ID you specify is NOT already present in previous rows, unless the student belongs to the same group!)',
 'Assigned datasets', # mandatory in position 3
 'New assigned dataset (for students previously having only one)',
 'Assigned distractor dataset (EXTRA)'
]

Example output:

> Distribution of HW1B datasets BEFORE (mean 9.333):
0  |#######
1  |##########
5  |######
7  |#######
8  |#######
9  |###############################
18 |########
21 |#######
22 |######
24 |########
27 |########
28 |#######

> Distribution of HW1B datasets AFTER (mean 8.083):
0  |######
1  |######
5  |#####
7  |#####
8  |######
9  |##############################
18 |######
21 |#####
22 |######
24 |########
27 |#######
28 |#######

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published