WIP: Formatting #148

i-be-snek · 2024-09-30T13:44:35Z

This PR is meant to do two things:

(1) format any left-over files using the pre-commit hook (done automatically)
(2) improve the README, especially after a large number of changes were made to the pipeline

i-be-snek · 2024-09-30T13:46:05Z

README.md

 Before you run our pipeline, please choose a version of prompts to proceed, which can be revised in the beginning of **run_prompts.py**

 ```shell
 from Database.Prompts.prompts import V_3 as target_prompts
 ```

 #### (Step 1) Raw output
-Choose the raw file contains the text you need to process, please use the clear raw file name to indicate your experiment, this name will be used as the output file, the api env you want to use, the decription of the experiment, the prompt category, and the batch file location you want to store the batch file (this is not mandatory, but it's good to check if you create correct batch file)
+Choose the raw file that contains the text you need to process. Please use clear raw file names to indicate your experiment. This name will be used as the output file, the api env you want to use, the decription of the experiment, the prompt category, and the batch file location you want to store the batch file (this is not mandatory, but it's good to check if you create correct batch file)


@liniiiiii

I don't understand this sentence:

This name will be used as the output file, the api env you want to use, the decription of the experiment, the prompt category, and the batch file location you want to store the batch file (this is not mandatory, but it's good to check if you create correct batch file)

Is it suggesting that the experiment name and description and category will all be the name of the output file?

Maybe adding a psuedo example (or a real example) could help

thanks, I will do that, where can I edit it, in the same branch?

You can edit the same branch. I think for READMEs you can even safely edit directly in the Github website :D

Especially needed after undergoing a large number of changes

README.md

… order

liniiiiii · 2024-11-27T19:49:39Z

Pls keep this pr for a while, I will check other readmes I edited later, thanks!

i-be-snek · 2024-11-27T20:15:26Z

README.md

 Before you run our pipeline, please choose a version of prompts to proceed, which can be revised in the beginning of **run_prompts.py**

 ```shell
 from Database.Prompts.prompts import V_3 as target_prompts
 ```
+##### Step 1: Experiment Settings


Wow, this looks great! Thanks :D

One thing that could help the reader is to say that these are the params to pass into run_prompts

Suggested change

##### Step 1: Experiment Settings

##### Step 1: Experiment Settings

Here is what you need to begin an experiment run with `Database/Prompts/run_prompts.py`:

i-be-snek · 2024-11-27T20:16:25Z

README.md

+4. **Prompt Category**: Indicate the prompt category, such as "all".
+
+5. **Batch File Location** (Optional): Specify where to store the batch file. This helps verify the batch file's creation.
+


Could we add something like this:

# check the args and flags poetry run python3 Database/Prompts/run_prompts.py --help

Output:

wikimpacts-py3.11➜ Wikimpacts git:(drop-l1-missing-all-impacts) ✗ poetry run python3 Database/Prompts/run_prompts.py --help usage: run_prompts.py [-h] [-f FILENAME] [-r RAW_DIR] [-b BATCH_DIR] [-m MODEL_NAME] [-t MAX_TOKENS] [-e API_ENV] [-d DESCRIPTION] [-p PROMPT_CATEGORY] options: -h, --help show this help message and exit -f FILENAME, --filename FILENAME The name of the json file in the <Wikipedia articles> directory -r RAW_DIR, --raw_dir RAW_DIR The directory containing Wikipedia json files to be run -b BATCH_DIR, --batch_dir BATCH_DIR The directory where the batch file will land (as .jsonl) -m MODEL_NAME, --model_name MODEL_NAME The model version applied in the experiment, like gpt-4o-mini. -t MAX_TOKENS, --max_tokens MAX_TOKENS The max tokens of the model selected -e API_ENV, --api_env API_ENV The env file that contains the API keys. -d DESCRIPTION, --description DESCRIPTION The description of the experiment -p PROMPT_CATEGORY, --prompt_category PROMPT_CATEGORY The prompt category of the experiment, can only choose from impact, basic, and all

thanks for the suggestion, I will check them out after I fixed the visualization!

i-be-snek · 2024-12-04T19:28:53Z

README.md

@@ -28,42 +30,137 @@ pre-commit installed at .git/hooks/pre-commit
 git lfs install
 ```

-## Quickstart
+## Development


As per the suggestion from @koffiworou, I've moved the dev doc section further to the top so that users can make sure they have all the basics and dependencies set up before developing.

i-be-snek commented Sep 30, 2024

View reviewed changes

i-be-snek force-pushed the main branch from deff00f to 4e36c73 Compare October 1, 2024 11:45

i-be-snek added the documentation Improvements or additions to documentation label Oct 2, 2024

i-be-snek mentioned this pull request Oct 3, 2024

upload the shape files for visualization #161

Merged

i-be-snek and others added 4 commits October 13, 2024 08:54

🚨 Fix linter warnings + format .py and .json

a2b13c7

📝 Improve readme

33791fc

Especially needed after undergoing a large number of changes

Add CSV file with Wikipedia URLs for database (#150)

c824d02

📝 Small edits to docs

f87c207

i-be-snek force-pushed the formatting branch from 0eb1056 to f87c207 Compare October 13, 2024 06:58

i-be-snek and others added 3 commits November 18, 2024 10:35

Merge branch 'main' into formatting

3c23201

Merge branch 'main' into formatting

ff05484

📝 Improve language in docs

d0bfeb1

i-be-snek commented Nov 26, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

i-be-snek and others added 4 commits November 26, 2024 14:11

🚨 Fix lint warnings

791b9a2

📝 Fix typos

d8f3a50

update the readme for the OpenAI models application part

fd7aeb9

update the readme for the OpenAI models application part with correct…

32f26d1

… order

i-be-snek commented Nov 27, 2024

View reviewed changes

📝 Move dev instructions to the top

b0698e4

i-be-snek commented Dec 4, 2024

View reviewed changes

i-be-snek added 2 commits December 18, 2024 15:56

Merge branch 'main' into formatting

8d49cf8

Merge branch 'main' into formatting

0eb54d1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Formatting #148

WIP: Formatting #148

i-be-snek commented Sep 30, 2024

i-be-snek Sep 30, 2024

liniiiiii Sep 30, 2024

i-be-snek Sep 30, 2024

liniiiiii commented Nov 27, 2024

i-be-snek Nov 27, 2024

i-be-snek Nov 27, 2024

i-be-snek Nov 27, 2024

liniiiiii Nov 27, 2024

i-be-snek Dec 4, 2024

		4. Prompt Category: Indicate the prompt category, such as "all".

		5. Batch File Location (Optional): Specify where to store the batch file. This helps verify the batch file's creation.

WIP: Formatting #148

Are you sure you want to change the base?

WIP: Formatting #148

Conversation

i-be-snek commented Sep 30, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liniiiiii commented Nov 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment