tr_tagcontentgrep -- grep for a xml/sgml/html tag |
tr_tagcontentgrep -- grep for a xml/sgml/html tag
tr_tagcontentgrep [-hl] regexp-pattern [file ...]
tr_tagcontentgrep opens all files provided on the command line and searches for the given pattern in the tags. The search is not case sensitive.
tr_tagcontentgrep is part of the HTML::TagReader package but is an example that you can also do 'reading by tag' without HTML::TagReader. tr_tagcontentgrep uses plain perl and sets the $/ variable. Working without HTML::TagReader causes however problems when working with faulty html code where single '<'-characters appear somewhere in the text.
All space in the tags is reduced to max. one space. You can search for ``a href'' even if the original tag had multiple spaces between ``a'' and ``href''.
-h this help
-l list filename and line number
tr_tagcontentgrep -l img file.html
would e.g print something like:
index.html:53: <IMG src=``../images/transpix.gif'' alt=``''> index.html:257: <IMG SRC=``../Logo.gif'' width=128 height=53>
tr_tagcontentgrep is part of the HTML::TagReader package and was written by Guido Socher [guido(at)linuxfocus.org]
tr_tagcontentgrep -- grep for a xml/sgml/html tag |