Work the Shell - Simple Scripts to Sophisticated HTML Forms
Let's look at a more interesting subset, instead, by adding a -c flag to have it output just a count of how many films match the specified criteria, you've given the command instead.
To do that, we don't need to go page by page, but just identify and extract the value from the match count on the page. For the comedies with “funny” in the title, the line on the page looks like this: “< Prev | 1 - 20 of 37 | Next 17 >”.
What we need to do is crack the HTML and look at the source to the link to “next 17” and see if it's extractable (is that a word?):
./findmovie.sh -d -g com funny | grep -i "next 17" | head -1 <td align=right><font face=arial size="-2"><nobr> ↪< Prev | <b>1 - 20</b> ↪ of <b>37</b> | <span ↪class="yperlink"><ai href="/mv/search?p=funny&yr=all ↪&gen=com\&syn_match=all&adv=y&type=feature ↪&n=17&b=21&h=s">Next 17</a> > ↪ </nobr></span></span></font></td></tr>
Well that's ugly. You'd think Yahoo didn't want to make this easy or something! It turns out though that this is a pretty tricky task, because if there are no matches, the link doesn't show up, and instead you see “Sorry, no matches were found”. If there are less than 20 matches, you see “Next >”, but it's not a clickable link, so it's not going to be so easy!
Given that I'm out of space, let's defer this topic until next month. Meanwhile, look at the source to various searches yourself and see if anything comes to mind. Otherwise, it'll be brute force!
Dave Taylor has been hacking shell scripts for a really long time, 30 years. He's the author of the popular Wicked Cool Shell Scripts and can be found on Twitter as @DaveTaylor and more generally at www.DaveTaylorOnline.com.
Dave Taylor has been hacking shell scripts for over thirty years. Really. He's the author of the popular "Wicked Cool Shell Scripts" and can be found on Twitter as @DaveTaylor and more generally at www.DaveTaylorOnline.com.
- Two Pi R
- Readers' Choice Awards 2013
- A Handy U-Boot Trick
- The Geek's Guide to the Coolest 2013 Holiday Gifts
- Sublime Text: One Editor to Rule Them All?
- RSS Feeds
- GIMP Shmimp, Give Me a Browser
- Raspberry Pi: the Perfect Home Server
- Tinker with Molecular Dynamics for Fun and Profit
- Linux Systems Administrator
- Máy sấy quần áo
1 hour 25 sec ago
- Services on GlusterFS
1 hour 10 min ago
- máy lọc nước
2 hours 1 min ago
- Máy lọc nước
2 hours 3 min ago
- Reply to comment | Linux Journal
2 hours 50 min ago
- Definitely cool stuff here
3 hours 51 min ago
- thanks for the information
5 hours 2 min ago
- nice information thanks
5 hours 41 min ago
8 hours 37 min ago
- The lost opportunity of security
20 hours 27 min ago