SA Bugzilla – Bug 8071
get_uri_detail_list & missing links
Last modified: 2023-11-24 07:53:01 UTC
there are links which are not identified when the html code contains tags like that: <!--[if mso]> <!--[if !mso]><!--> <!--[if gte mso 9]> ... <!--<![endif]--> <![endif]-->
my bugfix: before using get_uri_detail_list, I remove these tags: s/(?:<\!--)?<\!\[endif\]-->// for @lines; s/<\!--\[if [\(\)\|\!\w\s]+\]>(?:<\!-->)?// for @lines;
Those are HTML comments, not links or tags, and they have no place in the uri list. If you want to see that content, use a rawbody rule.
I mean that when there is an html code (containing links) inside comment tags, the function does not identify the links.
(In reply to John Hardin from comment #2) > Those are HTML comments, not links or tags, and they have no place in the > uri list. I think he's referring to this: https://email2go.io/blog/outlook-conditional-code Outlook will display anything inside those.
exactly it will really be displayed on some email clients It is probably the same than here: https://bz.apache.org/SpamAssassin/show_bug.cgi?id=8024
I think we should look at this for 4.0.1 and consider if this is a dupe of bug 8024 as well
*** Bug 8024 has been marked as a duplicate of this bug. ***
Agree for 4.0.1, needs some thought and proper testing to not break HTML stuff. Maybe someone can also look if these are good spam indicators for rules.
Created attachment 5878 [details] impacted message Here is a message for you to test. It can't identify the jpg images using get_uri_detail_list function
.jpg linked images on the provided spample are detected for me on trunk.
its valid html5, with 418 warnings and 0 errors :) if spamassassin had htmltidy, this would be pease of cake to get this info as how valid spam or not spam is, imho we should not take the comment fails as spammies, its a microsoft bug that should be fixed
Unless someone wants to argue with this being closed as WORKSFORME and reopen it, it should not have a target milestone, so I'm changing that from 4.0.1 as part of my 4.0.1 release cleanup.