{"id":2200,"date":"2015-09-16T21:19:43","date_gmt":"2015-09-16T12:19:43","guid":{"rendered":"http:\/\/blog.rutake.com\/techmemo\/?p=2200"},"modified":"2015-09-16T21:19:43","modified_gmt":"2015-09-16T12:19:43","slug":"java-html%e3%82%b9%e3%82%af%e3%83%ac%e3%83%bc%e3%83%94%e3%83%b3%e3%82%b0%e3%81%abjsoup","status":"publish","type":"post","link":"https:\/\/blog.rutake.com\/techmemo\/2015\/09\/16\/java-html%e3%82%b9%e3%82%af%e3%83%ac%e3%83%bc%e3%83%94%e3%83%b3%e3%82%b0%e3%81%abjsoup\/","title":{"rendered":"Java HTML\u30b9\u30af\u30ec\u30fc\u30d4\u30f3\u30b0\u306bJSoup"},"content":{"rendered":"<p>\u666e\u6bb5\u306fPHP\u306ePHP Simple HTML DOM Parser\u3092\u5229\u7528\u3057\u3066\u3044\u308b\u304c\u3001<br \/>\n\u30ed\u30b0\u30a4\u30f3\u306e\u5fc5\u8981\u306a\u753b\u9762\u9077\u79fb\u3068\u7d44\u307f\u5408\u308f\u305b\u308b\u306e\u3067Java\u3067\u30b9\u30af\u30ec\u30fc\u30d4\u30f3\u30b0\u3092\u4e45\u3005\u306b\u5b9f\u88c5\u3057\u3066\u307f\u305f\u3002<\/p>\n<p>\u4eca\u56de\u306fJSoup\u3068\u3044\u3046Java\u306e\u30e9\u30a4\u30d6\u30e9\u30ea\u3092\u5229\u7528\u3057\u305f\u3002CSS\u30bb\u30ec\u30af\u30bf\u30fc\u304c\u4f7f\u3048\u308b\u306e\u3067Simple HTML DOM Parser\u3068\u540c\u3058\u3088\u3046\u306a\u8a18\u8ff0\u304c\u3067\u304d\u308b\u3002<\/p>\n<p>\u4e0b\u8a18\u306e\u30b5\u30f3\u30d7\u30eb\u30b3\u30fc\u30c9\u306fSJIS\u306e\u30d5\u30a1\u30a4\u30eb\u3092\u30d1\u30fc\u30b9\u3059\u308b\u3082\u306e\u3002\u76ee\u7684\u306e\u8981\u7d20\u307e\u3067CSS\u30bb\u30ec\u30af\u30bf\u30fc\u3067\u53d6\u5f97\u3057\u305f\u5f8c\u306f\u3001elements\u3092\u30eb\u30fc\u30d7\u3067\u56de\u3059\u3002<\/p>\n<p>[java]<br \/>\nimport java.io.File;<br \/>\nimport java.io.IOException;<\/p>\n<p>import org.jsoup.Jsoup;<br \/>\nimport org.jsoup.nodes.Document;<br \/>\nimport org.jsoup.nodes.Element;<br \/>\nimport org.jsoup.select.Elements;<\/p>\n<p>public class JSoupSample {<\/p>\n<p>\tpublic static void main(String[] args) throws IOException {<br \/>\n\t\t\/\/ TODO Auto-generated method stub<br \/>\n\t\tFile input = new File(&#8220;c:\/html.txt&#8221;);<br \/>\n\t\tDocument doc = Jsoup.parse(input, &#8220;Windows-31J&#8221;, &#8220;http:\/\/example.com\/&#8221;);<\/p>\n<p>\t\tElements dateElements = doc.select(&#8220;div.locator table tbody tr&#8221;);<br \/>\n\t\tfor (Element element : dateElements) {<br \/>\n\t\t\t System.out.println(element.outerHtml());<br \/>\n\t\t}<\/p>\n<p>\t}<\/p>\n<p>}<br \/>\n[\/java]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u666e\u6bb5\u306fPHP\u306ePHP Simple HTML DOM Parser\u3092\u5229\u7528\u3057\u3066\u3044\u308b &hellip; <a href=\"https:\/\/blog.rutake.com\/techmemo\/2015\/09\/16\/java-html%e3%82%b9%e3%82%af%e3%83%ac%e3%83%bc%e3%83%94%e3%83%b3%e3%82%b0%e3%81%abjsoup\/\">\u7d9a\u304d\u3092\u8aad\u3080 <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[10],"tags":[],"_links":{"self":[{"href":"https:\/\/blog.rutake.com\/techmemo\/wp-json\/wp\/v2\/posts\/2200"}],"collection":[{"href":"https:\/\/blog.rutake.com\/techmemo\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.rutake.com\/techmemo\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.rutake.com\/techmemo\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.rutake.com\/techmemo\/wp-json\/wp\/v2\/comments?post=2200"}],"version-history":[{"count":1,"href":"https:\/\/blog.rutake.com\/techmemo\/wp-json\/wp\/v2\/posts\/2200\/revisions"}],"predecessor-version":[{"id":2201,"href":"https:\/\/blog.rutake.com\/techmemo\/wp-json\/wp\/v2\/posts\/2200\/revisions\/2201"}],"wp:attachment":[{"href":"https:\/\/blog.rutake.com\/techmemo\/wp-json\/wp\/v2\/media?parent=2200"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.rutake.com\/techmemo\/wp-json\/wp\/v2\/categories?post=2200"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.rutake.com\/techmemo\/wp-json\/wp\/v2\/tags?post=2200"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}