updated readme

This commit is contained in:
dash
2019-09-26 17:50:10 +02:00
parent 5af03d1ebb
commit fb2dfb1527

View File

@@ -30,6 +30,13 @@ focuses on pdf only.
If you are new to that term have a look here:
https://en.wikipedia.org/wiki/Metadata
## Download
```
git clone https://github.com/c0decave/pdfgrab
cd pdfgrab
python3 pdfgrab.py -h
```
## Usage
@@ -73,6 +80,14 @@ File: pdfgrab/ols2004v2.pdf
--------------------------------------------------------------------------------
```
### Directory Mode
```
./pdfgrab.py -F pdfgrab/
```
Will analyse all pdf's in that directory
### Google Search Mode
```
# ./pdfgrab.py -s site:kernel.org
@@ -107,10 +122,24 @@ File: pdfgrab/bpf_global_data_and_static_keys.pdf
/PTEX.Fullbanner This is pdfTeX, Version 3.14159265-2.6-1.40.17 (TeX Live 2016) kpathsea version 6.2.2
```
## TODO
* json output
* txt output
* catch conn refused connections
* set option for certificate verification, default is false
* complete analyse.txt and seperated
* clean up code
* do more testing
* add random useragent for google and website pdf gathering
* ~~add decryption routine~~
* ~~catch ssl exceptions~~
## Google
Search: filetype:pdf site:com
Results: 264.000.000
* Search: filetype:pdf site:com
* Results: 264.000.000
## Disclaimer