He then suggested using a combination of kill, filter, and/or scoring
to delete unwanted files.
I do not think the problem is that simple Ramsey (p.s., I know regular
expressions extremely well). It was not clear from the original
message, that the poster wanted to delete any message, any part of
which contained HTML markup. Some of us suggested ways to delete
or hide, only the part of the message with HTML code in it. This
function is not provided by yarn, except for the ability to send
MIME encoded messages to a program that handles them. The problem
is that various combinations of text and formatted text are not
always sent as MIME encoded text with appropriate headers. One
has to be a bit creative to automate the recognition of certain
kinds of formatting in certain parts of messages.
Once we determined that the poster wanted to junk all messages that
had any HTML code, even though parts of this message might be
plain text, the task was still not all that trivial. Again, not
all HTML text comes as part of a MIME message. For instance, some
mail clients send out all messages as plain text, and then again as
HTML text, with no MIME headers.
Also, lots of html text lack supposedly standardized tags such as