I am looking to remove everything before multiple HTML matching lines across all the files in current folder using Perl or sed command.
This is what one of my files in the current folder looks like:
<li>violet</li>
<p>red</p>
<p>blue</p>
<p>gray</p>
<li><a href="#part1">Part 1</a>
</li>
<h2>Part 1</h2>
<p>pink</p>
I am trying to remove everything before matching these HTML lines (which can contain empty lines as you can see):
<li><a href="#part1">Part 1</a>
</li>
So the final edited file would look like this:
<h2>Part 1</h2>
<p>pink</p>
I found that Perl can remove everything before specific characters using:
perl -p -e 's/^.*?abc/abc/'
and sed:
sed 's/[^abc]*\(abc.*\)/\1/'
but I wasn't able to match these to html tags.
If anyone has any suggestions, that would be awesome!