You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- A beginner friendly script to scrape pdf. You can easily get document info sunch as creator , creation_date and no. of pages. Extract as many pages as you want.
5
+
6
+
7
+
### Installing required libraries
8
+
9
+
`` pip install PyPDF2 ``
10
+
11
+
## How to use this script?
12
+
13
+
- Direct to the ScapePDF folder in Command prompt and type the following command:
14
+
15
+
`` python pdfscrapper.py ``
16
+
17
+
- After this you have to enter the path of the pdf file.
18
+
- Ex: C:/Users/Admin/Desktop/sample.pdf
19
+
- You will recieve information about the pdf like who created it and when it was created
20
+
- After that you will recieve scrapped text from pdf
0 commit comments