Got a RapidMiner crawling/scraping video coming up, but for now, here are some more X-Path ideas to play with:
//*
return all nodes
//*[contains(., 'Search Text')]
return all nodes that contain Search Text in their content. Case sensitive search.
//div[@id='div1']/following-sibling::*
return the next sibling of a specific node (not sure if this works in RapidMiner)
//div[@id='div1']/../
return the parent node of a specific node
in RapidMiner, precede all nodes with "h:", example: //h:div[@class='abc']/h:a