this is a scraper webservice written in python for google's appengine using BeautifulSoup and soupselect.py.
/scrape/[scrape-requests]?url=[url]&callback=[callback]{ [scrape-request], [scrape-request], ... }[key]: [css selector] [scrape-requests]~attributeTEXTHTMLmultivalue[]singlevalue
/scrape/{items%5B%5D:table.sortable%20tr%20td%20a{title:~title,href:~href}}?url=http://de.wikipedia.org/wiki/Hamburg&callback=jsonpCallback
/scrape/{
links[]: div.brief-post-text a {
title: {TEXT},
href: ~href
}
}?url=http://www.rollingstone.com/rockdaily/index.php/2008/12/08/
remembering-dimebag-darrell-abbott-on-the-anniversary-of-his-death/
/scrape/{
artist: div#view div#content {
title: h1 {TEXT},
bio: div#artist_bio {HTML},
image: div.portrait img{
src: ~src,
width: ~width,
height: ~height
}
},
similars[]: div.rgt div.tpbox li {
title: span.title {TEXT}
}
}?url=http://uk.real.com/music/artist/Madonna/
/scrape/{
links[]: div.tpbox ul li span.title a {
title: {TEXT},
href:~href
}
}?url=http://uk.real.com/music/artist/Madonna/
/scrape/{
bezirke[]: table.sortable tr td a {
title:~title
}
}?url=http://de.wikipedia.org/wiki/Hamburg