Skip to content

Commit

Permalink
Updated the links
Browse files Browse the repository at this point in the history
  • Loading branch information
sehsanm authored Jan 2, 2019
1 parent 54d4be5 commit a67051d
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions data/corpus/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ This link contains the extracted text from FaWiki XML file.

First, we extracted the text data. Next, we normalized it and simply segmented its sentences using regular expressions.

You can download the corpus using this [LINK](https://sbuacir-my.sharepoint.com/personal/se_mahmoudi_sbu_ac_ir/_layouts/15/download.aspx?SourceUrl=%2Fpersonal%2Fse_mahmoudi_sbu_ac_ir%2FDocuments%2Fsbunlp%2FwikiDump_dotSplitData_Nikvand.zip) here
You can download the corpus using this [LINK](https://sbuacir-my.sharepoint.com/:f:/g/personal/se_mahmoudi_sbu_ac_ir/EtCWTI-YEoRLiA3G3GtZOPQByjWXef5qPthP-XzIY3xqdA?e=sdbsjY) here

## IrBlogs
irBlogs is a standard Persian weblogs collection that is suitable for studying Persian social networks and evaluation of graph mining and blog retrieval algorithms.
Expand All @@ -20,4 +20,4 @@ You can find the collection [here](http://dbrg.ut.ac.ir/irblogs/)
## Persian News Corpus
Persian News Corpus contains more than 120 million sentences from tnews.

You can download corpus from [here](https://sbuacir-my.sharepoint.com/personal/se_mahmoudi_sbu_ac_ir/Documents/Forms/All.aspx?slrid=5cbcb09e%2D9091%2D7000%2Db143%2D92a4031b9417&RootFolder=%2Fpersonal%2Fse%5Fmahmoudi%5Fsbu%5Fac%5Fir%2FDocuments%2Fsbunlp&FolderCTID=0x01200065B78F960C7F3B4E9E0BBD567D049028)
You can download corpus from [here](https://sbuacir-my.sharepoint.com/:f:/g/personal/se_mahmoudi_sbu_ac_ir/EtCWTI-YEoRLiA3G3GtZOPQByjWXef5qPthP-XzIY3xqdA?e=sdbsjY)

0 comments on commit a67051d

Please sign in to comment.