pyzor.digest¶
Handle digesting the messages.
-
class
pyzor.digest.
DataDigester
(msg, spec=None)[source]¶ Bases:
object
The major workhouse class.
-
atomic_num_lines
= 4¶
-
digest
¶
-
email_ptrn
= <_sre.SRE_Pattern object>¶
-
longstr_ptrn
= <_sre.SRE_Pattern object>¶
-
min_line_length
= 8¶
-
unwanted_txt_repl
= ''¶
-
url_ptrn
= <_sre.SRE_Pattern object>¶
-
value
¶
-
ws_ptrn
= <_sre.SRE_Pattern object>¶
-
-
class
pyzor.digest.
HTMLStripper
(collector)[source]¶ Bases:
HTMLParser.HTMLParser
Strip all tags from the HTML.
-
class
pyzor.digest.
PrintingDataDigester
(msg, spec=None)[source]¶ Bases:
pyzor.digest.DataDigester
Extends DataDigester: prints out what we’re digesting.