Here is the structure for a typical H2G2 Page. We start by being able to extract the `entry_content` looking to parse that into fedwiki json - -possibly converting the content to markdown?
# CSS Path
For the articles `entry_content`:
html body div#outer_wrapper div#inner_wrapper div#main div#content div#content_left div#entry_wrapper article div#entry_content
# CSS Selector
#entry_content
# Xpath For some reason I can necver get the built in Liv ecode Xpath processing to work
//*[@id="entry_content"]