Skip to content

Commit 27a2388

Browse files
committed
Updated dataset link in README.md
1 parent 979245a commit 27a2388

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ cmake .. -DTF_BACKEND=1 && make -j$(nproc) && make install_python_lib
4545
cd ../examples/criteo # or your own path where .py launchers reside
4646
```
4747

48-
You will need to spawn a scheduler, at least one parameter server (PS), and at least one worker for training to begin. We provide example launching commands below. These commands launch the necessary ECRec instances on a single host. Note that our experiments need the Criteo Terabyte dataset downloaded to a local path on the worker machine. The following command downloads the pre-processed dataset from our S3 bucket.
48+
You will need to spawn a scheduler, at least one parameter server (PS), and at least one worker for training to begin. We provide example launching commands below. These commands launch the necessary ECRec instances on a single host. Note that our experiments need the Criteo Terabyte dataset downloaded to a local path on the worker machine. The following command downloads a part of the pre-processed dataset from our server. All processed parts of the Criteo Terabyte dataset can be found [here](https://ftp.pdl.cmu.edu/pub/datasets/DLRM/criteo-terabytes/).
4949

5050
```sh
5151
# scheduler
@@ -60,7 +60,7 @@ apt-get update && apt-get install -y zookeeper \
6060
python criteo_training.py --task_name=ps --zk_addr=zfs://0.0.0.0:2181/scheduler --task_index=0
6161

6262
# worker
63-
mkdir /xdl_training_samples && wget https://criteo-terabytes.s3-us-west-2.amazonaws.com/day_0_processed_tiny_0 -O /xdl_training_samples/data.txt
63+
mkdir /xdl_training_samples && wget https://ftp.pdl.cmu.edu/pub/datasets/DLRM/criteo-terabytes/day_0_processed_tiny_0 -O /xdl_training_samples/data.txt
6464

6565
python criteo_training.py --task_name=worker --zk_addr=zfs://0.0.0.0:2181/scheduler --task_index=0 --task_num=1
6666
```

0 commit comments

Comments
 (0)