Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
573 commits
Select commit Hold shift + click to select a range
11194ee
cleaning
puririshi98 Jan 28, 2025
ea15dc9
cleaning
puririshi98 Jan 28, 2025
87d7b70
cleaning
puririshi98 Jan 28, 2025
3d799a4
using similar prompt to vectorRAG team
puririshi98 Jan 29, 2025
131dabc
Merge branch 'master' into latest-txt2kg
puririshi98 Jan 29, 2025
f52f81a
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 29, 2025
9a68f7c
improving
puririshi98 Jan 29, 2025
ddd3828
improving
puririshi98 Jan 29, 2025
95db95b
improving
puririshi98 Jan 30, 2025
49848f9
improving
puririshi98 Jan 30, 2025
8383e96
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 30, 2025
87650fe
improving
puririshi98 Jan 30, 2025
ea3b4c4
Merge branch 'latest-txt2kg' of https://github.com/pyg-team/pytorch_g…
puririshi98 Jan 30, 2025
9ec3ddc
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 30, 2025
f95a137
improving
puririshi98 Jan 30, 2025
f91c3e0
Merge branch 'latest-txt2kg' of https://github.com/pyg-team/pytorch_g…
puririshi98 Jan 30, 2025
f377418
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 30, 2025
140c75e
improving
puririshi98 Jan 30, 2025
189ebf1
improving
puririshi98 Jan 30, 2025
1ef9fbc
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 30, 2025
f16a0ad
improving
puririshi98 Jan 30, 2025
78319f2
improving
puririshi98 Jan 30, 2025
c71a778
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 30, 2025
30c4766
improving
puririshi98 Jan 30, 2025
40c36fc
improving
puririshi98 Jan 30, 2025
f1e8d30
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 30, 2025
d5325c3
improving
puririshi98 Jan 30, 2025
07bacb9
improving
puririshi98 Jan 30, 2025
2aa99dd
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 30, 2025
1ad6562
improving
puririshi98 Jan 30, 2025
419539e
Merge branch 'latest-txt2kg' of https://github.com/pyg-team/pytorch_g…
puririshi98 Jan 30, 2025
4aed74d
improving
puririshi98 Jan 30, 2025
7d80449
improving
puririshi98 Jan 30, 2025
ca43505
improving
puririshi98 Jan 30, 2025
7d6557d
improving
puririshi98 Jan 30, 2025
4d054b7
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 30, 2025
c93fae5
Hybrid rag (#9995)
puririshi98 Jan 31, 2025
377dfaf
Merge branch 'master' into latest-txt2kg
puririshi98 Jan 31, 2025
b9dc0fc
cleanup
puririshi98 Feb 1, 2025
335f20e
fixing retry script
puririshi98 Feb 3, 2025
fb959cb
fix
puririshi98 Feb 5, 2025
7ed58bd
fix
puririshi98 Feb 5, 2025
8f5e3af
Merge branch 'master' into latest-txt2kg
puririshi98 Feb 5, 2025
b984886
Update hotpot_qa.py
puririshi98 Feb 5, 2025
4925f5c
Update hotpot_qa.py
puririshi98 Feb 5, 2025
7d064dc
save command (#10005)
puririshi98 Feb 5, 2025
ba0acd2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 5, 2025
5146f5a
cleaning
puririshi98 Feb 6, 2025
0749bb3
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 6, 2025
ba79f26
Merge branch 'master' into latest-txt2kg
puririshi98 Feb 11, 2025
5f736ff
Merge branch 'master' into latest-txt2kg
puririshi98 Feb 12, 2025
edd2c04
Update tech_qa.py
puririshi98 Feb 12, 2025
61aeb79
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 12, 2025
4dfe2dc
Update tech_qa.py w preproc instructions
puririshi98 Feb 19, 2025
8438ed6
Merge branch 'master' into latest-txt2kg
puririshi98 Feb 19, 2025
8150e00
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 19, 2025
10493a9
Update tech_qa.py
puririshi98 Feb 19, 2025
a95663c
Update g_retriever.py
puririshi98 Feb 28, 2025
d6e7ada
Merge branch 'master' into latest-txt2kg
puririshi98 Mar 4, 2025
2ff65bb
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 4, 2025
5f9388b
Update txt2kg.py
puririshi98 Mar 4, 2025
70d6cb2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 4, 2025
bb3992d
Update txt2kg.py
puririshi98 Mar 5, 2025
03fc8a5
NIMs can be unreliable, more retries
puririshi98 Mar 6, 2025
e70d88e
more retries for llmjudge
puririshi98 Mar 6, 2025
03be3de
update data set up (#10104)
puririshi98 Mar 8, 2025
959d5f1
Merge branch 'master' into latest-txt2kg
puririshi98 Mar 9, 2025
2ffd715
Update txt2kg.py
puririshi98 Mar 12, 2025
914aeea
fix syntax
puririshi98 Mar 12, 2025
04b4faf
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 12, 2025
068eb1e
Merge branch 'master' into latest-txt2kg
puririshi98 Mar 12, 2025
a128a85
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 12, 2025
cb2e153
Update txt2kg.py
puririshi98 Mar 12, 2025
ca03b7b
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 12, 2025
3da5920
Update llm_judge.py
puririshi98 Mar 12, 2025
12e4ad0
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 12, 2025
db6b66d
Update tech_qa.py
puririshi98 Mar 12, 2025
24c81d0
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 12, 2025
73b3c05
Update tech_qa.py
puririshi98 Mar 12, 2025
3718bbf
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 12, 2025
5eccd9c
Update txt2kg.py
puririshi98 Mar 12, 2025
184de47
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 12, 2025
fde14ef
Update txt2kg.py
puririshi98 Mar 12, 2025
fd8a2e6
Update tech_qa.py
puririshi98 Mar 12, 2025
5b86184
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 12, 2025
982e665
Update tech_qa.py
puririshi98 Mar 12, 2025
04959c2
endpoint for llm judge too
puririshi98 Mar 12, 2025
3b5ff6e
Update sentence_transformer.py
puririshi98 Mar 13, 2025
d2624e3
Update sentence_transformer.py
puririshi98 Mar 13, 2025
07922f4
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 13, 2025
ae31e7b
Update sentence_transformer.py
puririshi98 Mar 13, 2025
dcccbdd
Update llm.py
puririshi98 Mar 13, 2025
429b91c
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Mar 13, 2025
9f50ba5
Update sentence_transformer.py
puririshi98 Mar 13, 2025
d02f173
Update llm.py
puririshi98 Mar 13, 2025
1cf2e95
Update llm.py
puririshi98 Mar 13, 2025
9411a9d
Update tech_qa.py
puririshi98 Mar 13, 2025
b761dbc
Update backend_utils.py
puririshi98 Mar 19, 2025
70d88e2
Merge branch 'master' into latest-txt2kg
puririshi98 Mar 25, 2025
5616d99
Large Graph Indexer WebQSP Refactor (#9806)
zaristei Apr 2, 2025
f3c66aa
Merge branch 'master' into latest-txt2kg
puririshi98 Apr 3, 2025
518cbc5
Adds checkpointing for KG creation to `tech_qa.py` (#10135)
rlratzel Apr 3, 2025
aefdbd8
Update tech_qa.py
puririshi98 Apr 4, 2025
17218ac
Add max_seq_length in SentenceTransformer (#10166)
Kh4L Apr 4, 2025
fb77f15
Updates KG creation checkpointing (#10176)
rlratzel Apr 14, 2025
a9c2736
Updates get_data() in tech_qa.py example to handle JSON corpus data i…
rlratzel Apr 14, 2025
ab3542c
Merge branch 'master' into latest-txt2kg
puririshi98 Apr 14, 2025
ad8fefe
Update README.md
puririshi98 Apr 14, 2025
e1e9a39
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 14, 2025
265b6fa
Add chunking of docs in txt2kg (#10187)
Kh4L Apr 15, 2025
f9f7b86
Bidirectional Sampler Hotfix (#10188)
zaristei Apr 15, 2025
1d2f43b
Update feature_store.py
puririshi98 Apr 15, 2025
0815d35
Update backend_utils.py
puririshi98 Apr 15, 2025
e458037
Update rag_loader.py
puririshi98 Apr 15, 2025
34484f2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 15, 2025
3fd3874
Update web_qsp_dataset.py
puririshi98 Apr 15, 2025
3e34f79
Update backend_utils.py
puririshi98 Apr 15, 2025
cece6de
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 15, 2025
77cf657
Update backend_utils.py
puririshi98 Apr 15, 2025
a08f945
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 15, 2025
464e6be
Update backend_utils.py
puririshi98 Apr 15, 2025
4482156
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 15, 2025
fdde3b7
Update backend_utils.py
puririshi98 Apr 15, 2025
2caec51
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 15, 2025
a00a791
Merge branch 'master' into latest-txt2kg
puririshi98 Apr 15, 2025
7004fa1
Fix missing import in rag feature_store (#10193)
Kh4L Apr 15, 2025
f3da706
Fix missing import in rag #2 (#10194)
Kh4L Apr 15, 2025
f13fbbf
Fix max_seq_len for edge case + import (#10201)
Kh4L Apr 16, 2025
dd46eb2
Add wandb to tech_qa script (#10206)
Kh4L Apr 18, 2025
fb72fba
TXT2KG Add better chunking for the docs (#10214)
Kh4L Apr 22, 2025
1a750df
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Apr 22, 2025
f9c2d12
TXT2G Add manual num_gpus (#10215)
Kh4L Apr 22, 2025
2b6fbcc
TXT2KG: add OOM stats (#10210)
Kh4L Apr 22, 2025
efa144a
Merge branch 'master' into latest-txt2kg
puririshi98 Apr 22, 2025
ebc32d7
Fix TXT2G stats (#10216)
Kh4L Apr 22, 2025
c79d43c
TXT2KG Add regen dataset arg (#10219)
Kh4L Apr 22, 2025
77ff0c4
Update tech_qa.py
puririshi98 Apr 23, 2025
0ea79c8
Update tech_qa.py
puririshi98 Apr 23, 2025
8e6ab0d
Fix tech qa2 (#10224)
Kh4L Apr 23, 2025
ec27ca0
TXT2KG Update RAG tests (#10225)
Kh4L Apr 24, 2025
abae1af
TXT2KG Flag to control doc split (#10228)
Kh4L Apr 24, 2025
1b3e6e0
Merge branch 'master' into latest-txt2kg
puririshi98 Apr 28, 2025
f2b8251
spelling fix
puririshi98 May 9, 2025
ba98e39
v2 txt2kg pr (#10234)
puririshi98 May 29, 2025
8656d0c
fixed merge
puririshi98 Jun 2, 2025
102ade6
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jun 2, 2025
7c9b937
Sync latest txt2kg with latest bidirectional sampler branch (#10315)
zaristei Jun 11, 2025
1c3b1e7
GNN RAG Documentation (#10330)
zaristei Jun 18, 2025
9877ec4
Fixes to WebQSP Indexer and GRetriever so that unnecessary disk write…
zaristei Jun 26, 2025
a1a8cbb
Test Coverage for RAG FeatureStore, GraphStore, and Vector RAG Loader…
zaristei Jun 26, 2025
a32d4c5
fix conflicts
puririshi98 Jun 30, 2025
0a06297
Merge branch 'master' into latest-txt2kg
puririshi98 Jul 1, 2025
3757eaf
Delete examples/llm/hotpot_qa.py
puririshi98 Jul 1, 2025
1c5bf56
linting
puririshi98 Jul 1, 2025
f13cc78
ignore mypy errors for vectorRAG, they are trivial
puririshi98 Jul 1, 2025
d8a16d7
Update txt2kg.py
puririshi98 Jul 1, 2025
81e3fc9
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 1, 2025
69d719b
Update txt2kg.py
puririshi98 Jul 1, 2025
b0240ba
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 1, 2025
7514f9f
Merge branch 'master' into latest-txt2kg
puririshi98 Jul 2, 2025
2c1bf40
Merge branch 'master' into latest-txt2kg
puririshi98 Jul 3, 2025
8105dc0
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 3, 2025
c4d2ad2
clean TXT2KG PR (#10344)
puririshi98 Jul 4, 2025
e27566a
Update feature_store.py
puririshi98 Jul 7, 2025
0a5fd6e
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 7, 2025
1f6d25a
Update feature_store.py
puririshi98 Jul 7, 2025
1ad02bc
Update feature_store.py
puririshi98 Jul 7, 2025
9a5e73c
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 7, 2025
bad359c
Update feature_store.py
puririshi98 Jul 7, 2025
d6f8940
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 7, 2025
94e2bc9
Update feature_store.py
puririshi98 Jul 7, 2025
a07cefb
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 7, 2025
2374fe6
Update feature_store.py
puririshi98 Jul 7, 2025
6d576f8
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 7, 2025
11ac01c
Merge branch 'master' into latest-txt2kg
puririshi98 Jul 9, 2025
abaf336
cleaning rag (#10351)
puririshi98 Jul 11, 2025
693bb21
Merge branch 'master' into latest-txt2kg
puririshi98 Jul 14, 2025
75e74bc
Merge branch 'master' into latest-txt2kg
puririshi98 Jul 15, 2025
e279707
try this w/ n_gpus=8 (#10367)
puririshi98 Jul 15, 2025
141a999
Update llm.py
puririshi98 Jul 15, 2025
5015c31
Delete docs/source/advanced/rag.rst, out of date, will replace with a…
puririshi98 Jul 16, 2025
98e9b88
Update index.rst
puririshi98 Jul 16, 2025
9a23d5b
merging
puririshi98 Jul 16, 2025
3cd100a
Merge branch 'master' into latest-txt2kg
puririshi98 Jul 22, 2025
70491ce
Update txt2kg.py
puririshi98 Jul 23, 2025
f5f81b3
Update txt2kg_rag.py
puririshi98 Jul 23, 2025
17c2683
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 23, 2025
edb8827
fix typo
puririshi98 Jul 23, 2025
fa21715
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 23, 2025
90b7275
Update txt2kg_rag.py
puririshi98 Jul 23, 2025
ea5d385
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 23, 2025
f298e2e
Update txt2kg_rag.py
puririshi98 Jul 24, 2025
e5eff99
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 24, 2025
e2959b2
New NIM model testing (for txt2kg and llm judge) (#10375)
puririshi98 Jul 27, 2025
e8ef6e2
update LLM default max_new_tokens to 128
puririshi98 Jul 27, 2025
c505ea3
Merge branch 'master' into latest-txt2kg
puririshi98 Jul 27, 2025
6461456
make max_out_tokens at inference be max_num_chars found at train time
puririshi98 Jul 27, 2025
7d5a5b5
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 27, 2025
fc1e8b7
changing prompting technique for LLM Judge (#10377)
puririshi98 Jul 27, 2025
d6ab055
Update txt2kg_rag.py
puririshi98 Jul 28, 2025
c32f0de
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 28, 2025
68d6e77
Update txt2kg_rag.py
puririshi98 Jul 28, 2025
006206d
Merge branch 'master' into latest-txt2kg
puririshi98 Jul 29, 2025
5bc4d2d
Update g_retriever.py
puririshi98 Jul 29, 2025
89bf2be
Update g_retriever.py
puririshi98 Jul 29, 2025
4d87c0b
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 29, 2025
89fba07
Update g_retriever.py
puririshi98 Jul 29, 2025
36bb7b6
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 29, 2025
607bb8d
Merge branch 'master' into latest-txt2kg
puririshi98 Jul 30, 2025
e11c8c6
todo comment for cugraph cuvs
puririshi98 Jul 31, 2025
c524ef2
fix verbosity of loaded document retriever
puririshi98 Jul 31, 2025
7072a3d
Update vectorrag.py
puririshi98 Jul 31, 2025
725985b
Update vectorrag.py
puririshi98 Jul 31, 2025
c3ad86a
fixing CI
puririshi98 Jul 31, 2025
764cd0d
Update vectorrag.py
puririshi98 Jul 31, 2025
1f1db59
better max tokens hueristic
puririshi98 Aug 1, 2025
6126a6a
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Aug 1, 2025
c5f7c9b
Update README.md
puririshi98 Aug 11, 2025
50d1eaa
Merge branch 'master' into latest-txt2kg
puririshi98 Aug 12, 2025
6827379
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Aug 12, 2025
18c7e9a
Update large_graph_indexer.py
puririshi98 Aug 12, 2025
03e08d1
Update large_graph_indexer.py
puririshi98 Aug 13, 2025
1106734
add support for more reaosning models
puririshi98 Aug 26, 2025
f783b13
Merge branch 'master' into latest-txt2kg
puririshi98 Aug 27, 2025
fd41a18
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Aug 27, 2025
3df9a74
fix CI (#10423)
puririshi98 Aug 27, 2025
e9d2244
Update testing_rag.yml
puririshi98 Aug 27, 2025
2730ff0
adding verbose to see why CI is failing but tests pass
puririshi98 Aug 27, 2025
35ff2fe
Update CODEOWNERS
puririshi98 Aug 28, 2025
c682f36
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Aug 28, 2025
b570099
Merge branch 'master' into latest-txt2kg
puririshi98 Aug 28, 2025
2cffd72
Update test_llm.py
puririshi98 Aug 28, 2025
dc6d451
adding gc collect
puririshi98 Aug 28, 2025
1808ed9
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Aug 28, 2025
2a086d6
Update test_llm.py
puririshi98 Aug 28, 2025
8f9e91e
Update test_g_retriever.py
puririshi98 Aug 28, 2025
1990148
remove testing rag since tests pass but CI fails, we have tested the …
puririshi98 Aug 28, 2025
a816af7
Update web_qsp_dataset.py
puririshi98 Aug 29, 2025
5f24ebc
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Aug 29, 2025
0a4777a
Update neighbor_sampler.py
puririshi98 Aug 29, 2025
4d04c3a
Delete docs/source/_figures/multihop_example.svg
puririshi98 Aug 29, 2025
f37c972
Delete docs/source/_figures/flowchart.svg
puririshi98 Aug 29, 2025
0ca289e
Delete docs/source/_figures/remote_backend.svg
puririshi98 Aug 29, 2025
9dbeda9
Update neighbor_sampler.py
puririshi98 Aug 29, 2025
256ec7a
Merge branch 'master' into latest-txt2kg
puririshi98 Sep 2, 2025
a26b400
removing use_cwq unused var
puririshi98 Sep 2, 2025
959a7e4
Merge branch 'master' into latest-txt2kg
puririshi98 Sep 2, 2025
f9743e6
Add Annotation to Stored Triples (#10410)
nv-rliu Sep 2, 2025
98e76c6
Merge branch 'master' into latest-txt2kg
puririshi98 Sep 4, 2025
e66bd5e
Merge branch 'master' into latest-txt2kg
puririshi98 Sep 4, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
12 changes: 12 additions & 0 deletions .github/CODEOWNERS
Original file line number Diff line number Diff line change
Expand Up @@ -15,3 +15,15 @@
/torch_geometric/sampler/ @rusty1s @mananshah99 @akihironitta

/docs/ @rusty1s @akihironitta

/torch_geometric/loader/rag_loader.py @puririshi98

/torch_geometric/data/large_graph_indexer.py @puririshi98

/torch_geometric/utils/rag @puririshi98

/torch_geometric/nn/nlp @puririshi98

/torch_geometric/nn/models/g_retriever.py @puririshi98

/examples/llm @puririshi98
51 changes: 0 additions & 51 deletions .github/workflows/testing_rag.yml

This file was deleted.

1 change: 1 addition & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,7 @@ The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/).

### Added

- Adds TXT2KG class with examples ([#9992](https://github.com/pyg-team/pytorch_geometric/pull/9992))
- Added support for negative weights in `sparse_cross_entropy` ([#10432](https://github.com/pyg-team/pytorch_geometric/pull/10432))
- Added `connected_components()` method to `Data` and `HeterData` ([#10388](https://github.com/pyg-team/pytorch_geometric/pull/10388))
- Added LPFormer Graph Transformer for Link Prediction ([#9956](https://github.com/pyg-team/pytorch_geometric/pull/9956))
Expand Down
19 changes: 9 additions & 10 deletions examples/llm/README.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,11 @@
# Examples for Co-training LLMs and GNNs

| Example | Description |
| -------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| [`g_retriever.py`](./g_retriever.py) | Example for Retrieval-Augmented Generation (RAG) w/ GNN+LLM by co-training `LLAMA3` with `GAT` for answering questions based on knowledge graph information from the toy WebQSP dataset. We also have an [example repo](https://github.com/neo4j-product-examples/neo4j-gnn-llm-example) for integration with [Neo4j Graph DBs](neo4j.com) along with an associated [blog](https://developer.nvidia.com/blog/boosting-qa-accuracy-with-graphrag-using-pyg-and-graph-databases/) showing 2x accuracy gains over LLMs on real medical data. |
| [`g_retriever_utils/`](./g_retriever_utils/) | Contains multiple scripts for benchmarking GRetriever's architecture and evaluating different retrieval methods. |
| [`multihop_rag/`](./multihop_rag/) | Contains starter code and an example run for building a Multi-hop dataset using WikiHop5M and 2WikiMultiHopQA |
| [`nvtx_examples/`](./nvtx_examples/) | Contains examples of how to wrap functions using the NVTX profiler for CUDA runtime analysis. |
| [`molecule_gpt.py`](./molecule_gpt.py) | Example for [MoleculeGPT: Instruction Following Large Language Models for Molecular Property Prediction](https://ai4d3.github.io/2023/papers/34.pdf). Supports MoleculeGPT and InstructMol dataset |
| [`glem.py`](./glem.py) | Example for [GLEM](https://arxiv.org/abs/2210.14709), a GNN+LLM co-training model via variational Expectation-Maximization (EM) framework on node classification tasks to achieve SOTA results |
| [`git_mol.py`](./git_mol.py) | Example for [GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text](https://arxiv.org/abs/2308.06911) |
| [`protein_mpnn.py`](./protein_mpnn.py) | Example for [Robust deep learning--based protein sequence design using ProteinMPNN](https://www.biorxiv.org/content/10.1101/2022.06.03.494563v1) |
| Example | Description |
| -------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| [`g_retriever.py`](./g_retriever.py) | Example for Retrieval-Augmented Generation (RAG) w/ GNN+LLM by co-training `LLAMA3` with `GAT` for answering questions based on knowledge graph information from the toy WebQSP dataset. We also have an [example repo](https://github.com/neo4j-product-examples/neo4j-gnn-llm-example) for integration with [Neo4j Graph DBs][neo4j.com] along with an associated [blog](https://developer.nvidia.com/blog/boosting-qa-accuracy-with-graphrag-using-pyg-and-graph-databases/) showing 2x accuracy gains over LLMs on real medical data. |
| [`nvtx_examples/`](./nvtx_examples/) | Contains examples of how to wrap functions using the NVTX profiler for CUDA runtime analysis. |
| [`molecule_gpt.py`](./molecule_gpt.py) | Example for MoleculeGPT: Instruction Following Large Language Models for Molecular Property Prediction. Supports MoleculeGPT and InstructMol dataset |
| [`glem.py`](./glem.py) | Example for [GLEM](https://arxiv.org/abs/2210.14709), a GNN+LLM co-training model via variational Expectation-Maximization (EM) framework on node classification tasks to achieve SOTA results |
| [`git_mol.py`](./git_mol.py) | Example for GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text |
| [`protein_mpnn.py`](./protein_mpnn.py) | Example for [Robust deep learning--based protein sequence design using ProteinMPNN](https://www.biorxiv.org/content/10.1101/2022.06.03.494563v1) |
| [`txt2kg_rag.py`](./txt2kg_rag.py) | Full end 2 end RAG pipeline using TXT2KG and Vector and Graph RAG with a GNN to achieve state of the art results. Uses the [techQA dataset](https://paperswithcode.com/dataset/techqa) but can be extended to handle any RAG dataset with a corpus of documents and an associated set of Q+A pairs to be split for train/eval/test. See [Kumo.ai x NVIDIA GNN+LLM Webinar](https://www.youtube.com/watch?v=uRIA8e7Y_vs) for more details. Note that the TechQA data requires only a single document to answer each question so it can be viewed as a toy example. To see significant accuracy boosts from GNN+LLM TXT2KG based RAG, use data that requires multiple text chunks to answer a question. In cases where single document can answer, basic RAG should be sufficient. |
Loading
Loading