updated Readme.md

2019-11-05 14:50:02 +01:00
parent e1d7c3f760
commit 4f63e62690
1 changed files with 9 additions and 20 deletions
--- a/Readme.md
+++ b/Readme.md
@@ -1,6 +1,6 @@
 # pdfgrab

-* Version 0.4.4
+* Version 0.4.7

 ## What is it?

@@ -9,6 +9,14 @@ Basically it analyses PDF files for Metadata. You can direct it to a file or dir
 You can show it the url of a pdf or use the integrated googlesearch (thanx to mario vilas class)
 to search for pdfs at target site, download and analyse them.

+## What is new in 0.4.7 release?
+
+* Added support for html output file, this will be placed in the outdir path and is more clear then a text or json file
+* Added basic logging support, logfile is placed in pdfgrab.py directory
+* Reordered Codebase, exported functionality to some libraries
+* PDF XMP Metadata is grabbed now as well, but not yet saved in output files
+* added docs/ section with Changelog and Todo
+
 ## What information can be gathered?

 This depends on the software used to create the pdf. And if it has been cleaned. 
@@ -148,25 +156,6 @@ File: pdfgrab/bpf_global_data_and_static_keys.pdf
 /PTEX.Fullbanner This is pdfTeX, Version 3.14159265-2.6-1.40.17 (TeX Live 2016) kpathsea version 6.2.2
 ```

-## TODO
-* ~~fixed some bugs with *uncommon* pdfs~~
-* add socks proxy
-* ~~add queues~~ for threading
-* ~~add url list to output~~
-* ~~json file-output~~
-* ~~txt file-output~~
-* catch conn refused connections
-* ~~set option for certificate verification, default is true~~
-* ~~complete analyse.txt~~
-* clean up code
-* ~~do more testing~~
-	* do even more testing
-* ~~add random useragent for google and website pdf gathering~~
-* ~~add decryption routine~~
-* ~~catch ssl exceptions~~
-
-
-
 ## Google

 * Search: filetype:pdf site:com