I want To delete every thing but a message. For example, if we have the following:
<p class="TweetTextSize js-tweet-text tweet-text" lang="en" data-aria-label-part="0">.<a href="/TuckerCarlson" class="twitter-atreply pretty-link js-nav" dir="ltr" data-mentioned-user-id="22703645" ><s>@</s><b>TuckerCarlson</b></a>: "Massive demographic change has political consequences." <a href="/hashtag/Tucker?src=hash" data-query-source="hashtag_click" class="twitter-hashtag pretty-link js-nav" dir="ltr" ><s>#</s><b>Tucker</b></a><a href="https://t.co/PKqNgaihMQ" class="twitter-timeline-link u-hidden" data-pre-embedded="true" dir="ltr" >pic.twitter.com/PKqNgaihMQ</a></p>
The result after using the command should look like this:
Massive demographic change has political consequences.
My attempt so far
sed -n "/<p class="TweetTextSize js-tweet-text tweet-text" lang="en" data-aria-label-part="0">/,/<\/p>/p">>
What I am trying to do is to delete what is inside all <> </> pattern between <p> </p> and keep the rest.
I know it does not seem easy but I would still appreciate any help.