html - Extracting certain information from a page's source code? -
i trying figure out efficient way "scrap" data page's source. example be:
<th class="ddtitle" scope="colgroup" ><a href="/prod/bwckschd.p_disp_detail_sched?term_in=201370&crn_in=71492">printing industry/typogaphy - 71492 - int 101 - 0</a></th> i need text, "printing industry/typography - 71492 - int 101 - 0"
what options? (copy/paste wont because page house on 40 of these items. intend use technique other information.)
have considered using xpath?
Comments
Post a Comment