Jhowell, if you have a minute I'd appreciate some hints. Below are the first few lines of code for a Spanish dictionary with look-up and below it a Catalan dictionary that does not support look-up. Do you see a clear way to convert one into the other? (I can write some code for that) Thanks!
Code:
# look-up dictionary
<mbp:pagebreak/>
<a id="filepos63413" />
<mbp:frameset>
<idx:entry scriptable="yes">
<idx:orth value="a-">
<idx:infl>
<idx:iform name="" value="a-"/>
<idx:iform name="" value="a"/>
</idx:infl>
</idx:orth>
<div><span><b>a-</b></span></div><span>a-<sup>1</sup>. Del lat. <span><i>ad-</i></span>.</span><div><blockquote align="left"><span><b>I. </b></span><span><i>Morfema</i></span><div><blockquote align="left"><span ><span>pref.</span> — <span>Interviene, sin significación precisa, en la formación de algunos derivados. <span><i>Abaratar, acomplejar, acortar, afirmar, amansar.</i></span></span></span></blockquote></div></blockquote></div></idx:entry><hr></hr>
Code:
# book dictionary
<h2 class="centrat2" id="aid-F8901">A</h2>
<blockquote class="calibre27">
<p class="rf">->a<sup class="calibre32">1</sup></p>
<p class="df"><code class="calibre22"><sup class="calibre23">■</sup><strong class="calibre13">a</strong></code><sup class="calibre23">1</sup></p>
<p class="ps">Hom.: <strong class="calibre13">ah</strong></p>
<p class="p">[<em class="v">pl</em> <em class="calibre24">as</em>] <em class="v">f</em> <strong class="n">1</strong> <span class="v1">ESCR</span> Nom de la primera lletra de l’alfabet catalŕ, <em class="ex">a A</em>.</p>
<p class="p"><strong class="n">2</strong> <strong class="calibre13">no saber ni la a</strong> No saber ni els rudiments d’una cosa.</p>
</blockquote>