Page Scraper

by Chris Means

Sometimes a web site has just one piece of information you find interesting or want to keep track of (without having to constantly start your browser and visit the site). You could write your own custom Widget, or, if you're up to the challenge of using Regular Expressions, this simple tool can save you the trouble.

Regular Expressions are NOT for the faint of heart. If you're not an experienced computer programmer, please think twice before trying to use this Widget. Even programme More
Current Version: 1.4.2

Sorted by Newest

Sort by Most helpful
Please sign in if you'd like to review.
  1. Chris Means
    February 21, 2006 · version 1.2 Chris Means
    Technically, there's an unlimited number of Regular Expressions. However, it does have a finite syntax :)

    Here's a number of places to get you started (just remember we're using JavaScript Regular Expressions):
    http://en.wikipedia.org/wiki/Regular_expression
    http://www.regular-expressions.info/

  2. kyanardag
    February 21, 2006 · version 1.2 kyanardag
    very brilliant idea!
    i'm not an expert programmer, so is there any website or any other resource which gives the list of all possible regular expressions?

  3. Chris Means
    February 21, 2006 · version 1.2 Chris Means
    It is possible. When you try to load the page in your browser it's executing JavaScript on the page that forces it to load the original page. The Widget does not execute the JavaScript on the page, it just gets the HTML contents. You just have to build the right regular expression to get the text you're looking for.

  4. burrito22
    February 21, 2006 · version 1.2 burrito22
    oh so it is looking at the html, not just the text? ok, i am new to this but i am picking it up quick. the frame location is http://www.teamxlink.co.uk/statistics.php however when you try to load it by itself, it loads the entire site. there is a script at the beginning that makes this happen. I think this might just not be possible.

  5. Chris Means
    February 21, 2006 · version 1.2 Chris Means
    Change the URL to directly reference the page in the iframe and you'll be getting the right source. You'll still need to work on the regex however...don't forget, there's HTML in there.

Get It!

Avg. Rating:

StarStarStarStarStar (18)

Your Rating:

It's:

Version:

1.4.2

Updated:

2006-04-23

Downloads:

11,888
Windows & Mac

More tagged programming

DX Cluster

Downloads: 8,596
StarStarStarStarStar (6)

Multi-Meter

Downloads: 13,477
StarStarStarStarStar (2)

Bridget

Downloads: 235
StarStarStarStarStar (3)

tinyGO

Downloads: 4,526
StarStarStarStarStar (3)

File Uploader

Downloads: 2,534
StarStarStarStarStar (4)

More by Chris Means

SETI@Home User St…

Downloads: 5,126
StarStarStarStarStar (3)

Page Scraper

Downloads: 11,888
StarStarStarStarStar (18)

Copyright © 2008 Yahoo! Inc. All rights reserved. · Copyright Policy · Terms of Service · Suggestions

NOTICE: We collect personal information on this site. To learn more about how we use your information, see our Privacy Policy.