Skip to content

Commit 9e969f8

Browse files
author
Joshua Mayanja
committed
Added another dataset
1 parent 0ccbecd commit 9e969f8

6 files changed

+850
-72
lines changed

datasets/exports/spam-dataset.csv

+18
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,22 @@
11
label,text,source
2+
spam,Let the financial Robot be your companion in the financial market.,henry-spam
3+
spam,Every your dollar can turn into $100 after you lunch this Robot. https://Vaw.187sued.de/gotodate/promo ,henry-spam
4+
spam,Find out about the easiest way of money earning.,henry-spam
5+
spam,It is the best time to launch the Robot to get more money. https://Vaw.187sued.de/gotodate/promo ,henry-spam
6+
spam,Make dollars just sitting home.,henry-spam
7+
spam,Make your computer to be you earning instrument. https://Vaw.187sued.de/gotodate/promo ,henry-spam
8+
spam,Make thousands of bucks. Financial robot will help you to do it!,henry-spam
9+
spam,No need to worry about the future if your use this financial robot. https://Vaw.187sued.de/gotodate/promo ,henry-spam
10+
spam,Need some more money? Robot will earn them really fast.,henry-spam
11+
spam,Robot is the best way for everyone who looks for financial independence. https://Vaw.187sued.de/gotodate/promo ,henry-spam
12+
spam,Provide your family with the money in age. Launch the Robot!,henry-spam
13+
spam,Join the society of successful people who make money here. https://Vaw.187sued.de/gotodate/promo ,henry-spam
14+
spam,Watch your money grow while you invest with the Robot.,henry-spam
15+
spam,Make $1000 from $1 in a few minutes. Launch the financial robot now. https://Vaw.187sued.de/gotodate/promo ,henry-spam
16+
spam,We know how to make our future rich and do you?,henry-spam
17+
spam,Find out about the easiest way of money earning. https://Vaw.187sued.de/gotodate/promo ,henry-spam
18+
spam,"Make money online, staying at home this cold winter.",henry-spam
19+
spam,We know how to increase your financial stability. https://Vaw.187sued.de/gotodate/promo ,henry-spam
220
ham,"Subject: enron methanol ; meter # : 988291
321
this is a follow up to the note i gave you on monday , 4 / 3 / 00 { preliminary
422
flow data provided by daren } .

datasets/exports/spam-tokenizer.json

+1-1
Large diffs are not rendered by default.

nbs/.ipynb_checkpoints/Convert Datasets to Vectors-checkpoint.ipynb

+14-14
Original file line numberDiff line numberDiff line change
@@ -69,31 +69,31 @@
6969
" </thead>\n",
7070
" <tbody>\n",
7171
" <tr>\n",
72-
" <th>12694</th>\n",
72+
" <th>12712</th>\n",
7373
" <td>ham</td>\n",
7474
" <td>This song means so much to me thank you soooo...</td>\n",
7575
" <td>youtube-spam</td>\n",
7676
" </tr>\n",
7777
" <tr>\n",
78-
" <th>12695</th>\n",
78+
" <th>12713</th>\n",
7979
" <td>ham</td>\n",
8080
" <td>&amp;lt;3</td>\n",
8181
" <td>youtube-spam</td>\n",
8282
" </tr>\n",
8383
" <tr>\n",
84-
" <th>12696</th>\n",
84+
" <th>12714</th>\n",
8585
" <td>spam</td>\n",
8686
" <td>KATY PERRY, I AM THE \"DÉCIO CABELO\", \"DECIO HA...</td>\n",
8787
" <td>youtube-spam</td>\n",
8888
" </tr>\n",
8989
" <tr>\n",
90-
" <th>12697</th>\n",
90+
" <th>12715</th>\n",
9191
" <td>ham</td>\n",
9292
" <td>Honestly speaking except taylor swift and adel...</td>\n",
9393
" <td>youtube-spam</td>\n",
9494
" </tr>\n",
9595
" <tr>\n",
96-
" <th>12698</th>\n",
96+
" <th>12716</th>\n",
9797
" <td>ham</td>\n",
9898
" <td>who is going to reach the billion first : katy...</td>\n",
9999
" <td>youtube-spam</td>\n",
@@ -104,11 +104,11 @@
104104
],
105105
"text/plain": [
106106
" label text source\n",
107-
"12694 ham This song means so much to me thank you soooo... youtube-spam\n",
108-
"12695 ham &lt;3 youtube-spam\n",
109-
"12696 spam KATY PERRY, I AM THE \"DÉCIO CABELO\", \"DECIO HA... youtube-spam\n",
110-
"12697 ham Honestly speaking except taylor swift and adel... youtube-spam\n",
111-
"12698 ham who is going to reach the billion first : katy... youtube-spam"
107+
"12712 ham This song means so much to me thank you soooo... youtube-spam\n",
108+
"12713 ham &lt;3 youtube-spam\n",
109+
"12714 spam KATY PERRY, I AM THE \"DÉCIO CABELO\", \"DECIO HA... youtube-spam\n",
110+
"12715 ham Honestly speaking except taylor swift and adel... youtube-spam\n",
111+
"12716 ham who is going to reach the billion first : katy... youtube-spam"
112112
]
113113
},
114114
"execution_count": 2,
@@ -142,7 +142,7 @@
142142
"data": {
143143
"text/plain": [
144144
"('ham',\n",
145-
" 'Subject: txu noms . for 3 / 14 / 01\\r\\n( see attached file : hplno 314 . xls )\\r\\n- hplno 314 . xls')"
145+
" 'Subject: password reset\\r\\nthis is a generated email - do not reply !\\r\\nif you need further assistance , contact the isc help desk at : 713 - 345 - 4727\\r\\nthe password for your account : po 0507544 has been reset to : 14031399')"
146146
]
147147
},
148148
"execution_count": 4,
@@ -214,8 +214,8 @@
214214
"name": "stderr",
215215
"output_type": "stream",
216216
"text": [
217-
"2022-05-12 11:31:40.540490: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/lib/mesa-diverted/x86_64-linux-gnu:/usr/lib/x86_64-linux-gnu/mesa:/usr/lib/x86_64-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/gallium-pipe\n",
218-
"2022-05-12 11:31:40.540552: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.\n"
217+
"2022-05-15 09:03:05.326533: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/lib/mesa-diverted/x86_64-linux-gnu:/usr/lib/x86_64-linux-gnu/mesa:/usr/lib/x86_64-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/gallium-pipe\n",
218+
"2022-05-15 09:03:05.326601: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.\n"
219219
]
220220
}
221221
],
@@ -355,7 +355,7 @@
355355
{
356356
"data": {
357357
"text/plain": [
358-
"5922675"
358+
"5922945"
359359
]
360360
},
361361
"execution_count": 21,

nbs/.ipynb_checkpoints/SPAM DATASETS-checkpoint.ipynb

+674-2
Large diffs are not rendered by default.

nbs/Convert Datasets to Vectors.ipynb

+14-14
Original file line numberDiff line numberDiff line change
@@ -69,31 +69,31 @@
6969
" </thead>\n",
7070
" <tbody>\n",
7171
" <tr>\n",
72-
" <th>12694</th>\n",
72+
" <th>12712</th>\n",
7373
" <td>ham</td>\n",
7474
" <td>This song means so much to me thank you soooo...</td>\n",
7575
" <td>youtube-spam</td>\n",
7676
" </tr>\n",
7777
" <tr>\n",
78-
" <th>12695</th>\n",
78+
" <th>12713</th>\n",
7979
" <td>ham</td>\n",
8080
" <td>&amp;lt;3</td>\n",
8181
" <td>youtube-spam</td>\n",
8282
" </tr>\n",
8383
" <tr>\n",
84-
" <th>12696</th>\n",
84+
" <th>12714</th>\n",
8585
" <td>spam</td>\n",
8686
" <td>KATY PERRY, I AM THE \"DÉCIO CABELO\", \"DECIO HA...</td>\n",
8787
" <td>youtube-spam</td>\n",
8888
" </tr>\n",
8989
" <tr>\n",
90-
" <th>12697</th>\n",
90+
" <th>12715</th>\n",
9191
" <td>ham</td>\n",
9292
" <td>Honestly speaking except taylor swift and adel...</td>\n",
9393
" <td>youtube-spam</td>\n",
9494
" </tr>\n",
9595
" <tr>\n",
96-
" <th>12698</th>\n",
96+
" <th>12716</th>\n",
9797
" <td>ham</td>\n",
9898
" <td>who is going to reach the billion first : katy...</td>\n",
9999
" <td>youtube-spam</td>\n",
@@ -104,11 +104,11 @@
104104
],
105105
"text/plain": [
106106
" label text source\n",
107-
"12694 ham This song means so much to me thank you soooo... youtube-spam\n",
108-
"12695 ham &lt;3 youtube-spam\n",
109-
"12696 spam KATY PERRY, I AM THE \"DÉCIO CABELO\", \"DECIO HA... youtube-spam\n",
110-
"12697 ham Honestly speaking except taylor swift and adel... youtube-spam\n",
111-
"12698 ham who is going to reach the billion first : katy... youtube-spam"
107+
"12712 ham This song means so much to me thank you soooo... youtube-spam\n",
108+
"12713 ham &lt;3 youtube-spam\n",
109+
"12714 spam KATY PERRY, I AM THE \"DÉCIO CABELO\", \"DECIO HA... youtube-spam\n",
110+
"12715 ham Honestly speaking except taylor swift and adel... youtube-spam\n",
111+
"12716 ham who is going to reach the billion first : katy... youtube-spam"
112112
]
113113
},
114114
"execution_count": 2,
@@ -142,7 +142,7 @@
142142
"data": {
143143
"text/plain": [
144144
"('ham',\n",
145-
" 'Subject: txu noms . for 3 / 14 / 01\\r\\n( see attached file : hplno 314 . xls )\\r\\n- hplno 314 . xls')"
145+
" 'Subject: password reset\\r\\nthis is a generated email - do not reply !\\r\\nif you need further assistance , contact the isc help desk at : 713 - 345 - 4727\\r\\nthe password for your account : po 0507544 has been reset to : 14031399')"
146146
]
147147
},
148148
"execution_count": 4,
@@ -214,8 +214,8 @@
214214
"name": "stderr",
215215
"output_type": "stream",
216216
"text": [
217-
"2022-05-12 11:31:40.540490: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/lib/mesa-diverted/x86_64-linux-gnu:/usr/lib/x86_64-linux-gnu/mesa:/usr/lib/x86_64-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/gallium-pipe\n",
218-
"2022-05-12 11:31:40.540552: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.\n"
217+
"2022-05-15 09:03:05.326533: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/lib/mesa-diverted/x86_64-linux-gnu:/usr/lib/x86_64-linux-gnu/mesa:/usr/lib/x86_64-linux-gnu/dri:/usr/lib/x86_64-linux-gnu/gallium-pipe\n",
218+
"2022-05-15 09:03:05.326601: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.\n"
219219
]
220220
}
221221
],
@@ -355,7 +355,7 @@
355355
{
356356
"data": {
357357
"text/plain": [
358-
"5922675"
358+
"5922945"
359359
]
360360
},
361361
"execution_count": 21,

0 commit comments

Comments
 (0)