<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Parser on Micha Kops&#39; Tech Notes</title>
    <link>https://www.hascode.com/tags/parser/</link>
    <description>Recent content in Parser on Micha Kops&#39; Tech Notes</description>
    <generator>Hugo</generator>
    <language>en</language>
    <copyright>Copyright © 2010 - 2025 Micha Kops. #e9d956c0c0154a221ad83c925346a8fa0e72f866</copyright>
    <lastBuildDate>Tue, 22 Jul 2014 00:00:00 +0200</lastBuildDate>
    <atom:link href="https://www.hascode.com/tags/parser/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>XMLBeam: Snippets and Examples</title>
      <link>https://www.hascode.com/xmlbeam-snippets-and-examples/</link>
      <pubDate>Tue, 22 Jul 2014 00:00:00 +0200</pubDate>
      <guid>https://www.hascode.com/xmlbeam-snippets-and-examples/</guid>
      <description>&lt;div id=&#34;preamble&#34;&gt;
&lt;div class=&#34;sectionbody&#34;&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;p&gt;XMLBeam is an interesting library using an approach of projecting parts of an XML DOM tree into Java using some simple interfaces, annotations and XPath expressions.&lt;/p&gt;
&lt;/div&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;p&gt;In the following article, I’d like to share three experiments of mine with this library for reading, writing XML and parsing a live RSS feed.&lt;/p&gt;
&lt;/div&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;div class=&#34;title&#34;&gt;RSS Feed Projection Interface&lt;/div&gt;
&lt;p&gt;&lt;span class=&#34;image&#34;&gt;&lt;img src=&#34;rss-projection-interface.png&#34; alt=&#34;rss projection interface&#34;/&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;div class=&#34;sect1&#34;&gt;
&lt;h2 id=&#34;_dependencies&#34;&gt;Dependencies&lt;/h2&gt;
&lt;div class=&#34;sectionbody&#34;&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;p&gt;Using Maven, we need to add only one dependency to our &lt;em&gt;pom.xml&lt;/em&gt;:&lt;/p&gt;
&lt;/div&gt;</description>
    </item>
    <item>
      <title>Creating Grammar Parsers in Java and Scala with Parboiled</title>
      <link>https://www.hascode.com/creating-grammar-parsers-in-java-and-scala-with-parboiled/</link>
      <pubDate>Sun, 26 Jan 2014 00:00:00 +0100</pubDate>
      <guid>https://www.hascode.com/creating-grammar-parsers-in-java-and-scala-with-parboiled/</guid>
      <description>&lt;div id=&#34;preamble&#34;&gt;
&lt;div class=&#34;sectionbody&#34;&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;p&gt;Parboiled is a modern. lightweight and easy to use library to parse expression grammars in Java or Scala and in my humble opinion it is perfect for use cases where you need something between regular expressions and a complex parser generator like ANTLR.&lt;/p&gt;
&lt;/div&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;p&gt;In the following tutorial we’re going to create a simple grammar to specify a task list and write an implementation of a parser also as unit tests for each grammar rule in Java.&lt;/p&gt;
&lt;/div&gt;</description>
    </item>
    <item>
      <title>Content Detection, Metadata and Content Extraction with Apache Tika</title>
      <link>https://www.hascode.com/content-detection-metadata-and-content-extraction-with-apache-tika/</link>
      <pubDate>Sun, 02 Dec 2012 00:00:00 +0100</pubDate>
      <guid>https://www.hascode.com/content-detection-metadata-and-content-extraction-with-apache-tika/</guid>
      <description>&lt;div id=&#34;preamble&#34;&gt;
&lt;div class=&#34;sectionbody&#34;&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;p&gt;Encountering the situation that you want to extract meta-data or content from a file – might it be an office document, a spreadsheet or even a mp3 or an image – or you’d like to detect the content type for a given file then Apache Tika might be a helpful tool for you.&lt;/p&gt;
&lt;/div&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;p&gt;Apache Tika supports a variety of document formats and has a nice, extendable parser and detection API with a lot of built-in parsers available.&lt;/p&gt;
&lt;/div&gt;</description>
    </item>
    <item>
      <title>Screenscraping made easy using jsoup and Maven</title>
      <link>https://www.hascode.com/screenscraping-made-easy-using-jsoup-and-maven/</link>
      <pubDate>Tue, 30 Aug 2011 00:00:00 +0200</pubDate>
      <guid>https://www.hascode.com/screenscraping-made-easy-using-jsoup-and-maven/</guid>
      <description>&lt;div id=&#34;preamble&#34;&gt;
&lt;div class=&#34;sectionbody&#34;&gt;
&lt;div class=&#34;imageblock&#34;&gt;
&lt;div class=&#34;content&#34;&gt;
&lt;img src=&#34;logo2.png&#34; alt=&#34;logo2&#34;/&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;p&gt;Sometimes in a developer’s life there is no clean API available to gather information from a web application .. no SOAP, no XML-RPC and no REST .. just a website hiding the information we’re looking for somewhere in its DOM hierarchy – so the only solution is screenscraping.&lt;/p&gt;
&lt;/div&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;p&gt;Screenscraping always leaves me with a bad feeling – but luckily there is a tool that makes this job at least a bit easier for a developer .. jsoup to the rescue!&lt;/p&gt;
&lt;/div&gt;</description>
    </item>
    <item>
      <title>jq Snippets</title>
      <link>https://www.hascode.com/jq-snippets/</link>
      <pubDate>Mon, 01 Mar 2010 00:00:00 +0100</pubDate>
      <guid>https://www.hascode.com/jq-snippets/</guid>
      <description>&lt;div class=&#34;sect1&#34;&gt;
&lt;h2 id=&#34;_sample_json_file&#34;&gt;Sample JSON File&lt;/h2&gt;
&lt;div class=&#34;sectionbody&#34;&gt;
&lt;div class=&#34;paragraph&#34;&gt;
&lt;p&gt;Sample JSON File containing well known programmers that we use for the following examples&lt;/p&gt;
&lt;/div&gt;
&lt;div class=&#34;listingblock&#34;&gt;
&lt;div class=&#34;title&#34;&gt;coders.json&lt;/div&gt;
&lt;div class=&#34;content&#34;&gt;
&lt;pre class=&#34;highlight&#34;&gt;&lt;code class=&#34;language-json&#34; data-lang=&#34;json&#34;&gt;[
  {
    &amp;#34;name&amp;#34;: &amp;#34;Bjarne Stroustrup&amp;#34;,
    &amp;#34;languages&amp;#34;: [
      {
        &amp;#34;name&amp;#34;: &amp;#34;C++&amp;#34;,
        &amp;#34;year_created&amp;#34;: 1983
      }
    ],
    &amp;#34;details&amp;#34;: {
      &amp;#34;nationality&amp;#34;: &amp;#34;Danish&amp;#34;,
      &amp;#34;awards&amp;#34;: [&amp;#34;IEEE Computer Society Computer Pioneer Award&amp;#34;, &amp;#34;Charles Stark Draper Prize&amp;#34;]
    }
  },
  {
    &amp;#34;name&amp;#34;: &amp;#34;Guido van Rossum&amp;#34;,
    &amp;#34;languages&amp;#34;: [
      {
        &amp;#34;name&amp;#34;: &amp;#34;Python&amp;#34;,
        &amp;#34;year_created&amp;#34;: 1991
      }
    ],
    &amp;#34;details&amp;#34;: {
      &amp;#34;nationality&amp;#34;: &amp;#34;Dutch&amp;#34;,
      &amp;#34;awards&amp;#34;: [&amp;#34;Free Software Foundation Award for the Advancement of Free Software&amp;#34;, &amp;#34;NLUUG Award&amp;#34;]
    }
  },
  {
    &amp;#34;name&amp;#34;: &amp;#34;James Gosling&amp;#34;,
    &amp;#34;languages&amp;#34;: [
      {
        &amp;#34;name&amp;#34;: &amp;#34;Java&amp;#34;,
        &amp;#34;year_created&amp;#34;: 1995
      }
    ],
    &amp;#34;details&amp;#34;: {
      &amp;#34;nationality&amp;#34;: &amp;#34;Canadian&amp;#34;,
      &amp;#34;awards&amp;#34;: [&amp;#34;Order of Canada&amp;#34;, &amp;#34;The Economist Innovation Award&amp;#34;]
    }
  },
  {
    &amp;#34;name&amp;#34;: &amp;#34;Dennis Ritchie&amp;#34;,
    &amp;#34;languages&amp;#34;: [
      {
        &amp;#34;name&amp;#34;: &amp;#34;C&amp;#34;,
        &amp;#34;year_created&amp;#34;: 1972
      },
      {
        &amp;#34;name&amp;#34;: &amp;#34;Unix&amp;#34;,
        &amp;#34;year_created&amp;#34;: 1969
      }
    ],
    &amp;#34;details&amp;#34;: {
      &amp;#34;nationality&amp;#34;: &amp;#34;American&amp;#34;,
      &amp;#34;awards&amp;#34;: [&amp;#34;Turing Award&amp;#34;, &amp;#34;National Medal of Technology&amp;#34;]
    }
  },
  {
    &amp;#34;name&amp;#34;: &amp;#34;Brendan Eich&amp;#34;,
    &amp;#34;languages&amp;#34;: [
      {
        &amp;#34;name&amp;#34;: &amp;#34;JavaScript&amp;#34;,
        &amp;#34;year_created&amp;#34;: 1995
      }
    ],
    &amp;#34;details&amp;#34;: {
      &amp;#34;nationality&amp;#34;: &amp;#34;American&amp;#34;,
      &amp;#34;awards&amp;#34;: [&amp;#34;Webby Award&amp;#34;]
    }
  }
]&lt;/code&gt;&lt;/pre&gt;
&lt;/div&gt;</description>
    </item>
  </channel>
</rss>
