updated Readme.md

2019-11-05 14:50:02 +01:00
parent e1d7c3f760
commit 4f63e62690
1 changed files with 9 additions and 20 deletions
--- a/Readme.md
+++ b/Readme.md
@@ -1,6 +1,6 @@
 # pdfgrab
-* Version 0.4.4
+* Version 0.4.7
 ## What is it?
@@ -9,6 +9,14 @@ Basically it analyses PDF files for Metadata. You can direct it to a file or dir
 You can show it the url of a pdf or use the integrated googlesearch (thanx to mario vilas class)
 to search for pdfs at target site, download and analyse them.
 ## What is new in 0.4.7 release?
 * Added support for html output file, this will be placed in the outdir path and is more clear then a text or json file
 * Added basic logging support, logfile is placed in pdfgrab.py directory
 * Reordered Codebase, exported functionality to some libraries
 * PDF XMP Metadata is grabbed now as well, but not yet saved in output files
 * added docs/ section with Changelog and Todo
 ## What information can be gathered?
 This depends on the software used to create the pdf. And if it has been cleaned. 
@@ -148,25 +156,6 @@ File: pdfgrab/bpf_global_data_and_static_keys.pdf
 /PTEX.Fullbanner This is pdfTeX, Version 3.14159265-2.6-1.40.17 (TeX Live 2016) kpathsea version 6.2.2
 ```
 ## TODO
 * ~~fixed some bugs with *uncommon* pdfs~~
 * add socks proxy
 * ~~add queues~~ for threading
 * ~~add url list to output~~
 * ~~json file-output~~
 * ~~txt file-output~~
 * catch conn refused connections
 * ~~set option for certificate verification, default is true~~
 * ~~complete analyse.txt~~
 * clean up code
 * ~~do more testing~~
 	* do even more testing
 * ~~add random useragent for google and website pdf gathering~~
 * ~~add decryption routine~~
 * ~~catch ssl exceptions~~
 ## Google
 * Search: filetype:pdf site:com