Linking data with PanLex

IntroductionUp

The PanLex team welcomes and cooperates with developers who plan to link PanLex with other data.

Current links

The current status of PanLex in the recognized cloud of linked open linguistic data is visible in the LLOD diagram. You can find PanLex in it by searching the page for the name.

The first project to create links between PanLex and other data was designed by Patrick Westphal and Claus Stadler in 02012. This work is demonstrated in the PanLex RDF interface. Stadler and Westphal gave a presentation on linked data and PanLex2RDF at the PanLex 02013 internship program.

Links within PanLex

The PanLex database design provides for links to other data, in the form of table columns whose values identify their records with other data. These columns include:

  • Column lv.lc, whose values are ISO 639 alpha-3 codes.
  • Columns lc.gl, lc.glat, and lc.glon, whose values are Glottocodes, latitudes, and longitudes of languages according to Glottolog.
  • Column ex.tt, whose values are, for some values of column ex.lv, identifiers defined by published standards.

An example of this last type is language variety 9635, whose UID is art-294 and whose expressions’ tt values are the dotted-numeric identifiers of a standard titled Semantic Domains, originated by Ron Moe and published by SIL International. One of the expressions in this language variety has the tt value “5.5.5”, and it expresses the domain of things that fires produce (sparks, ashes, smoke, heat, etc.).

Linked data in PanLex

Some sources of data in PanLex are linked to other data. Links via those sources to other data may be creatable with meaning identifiers.

Numerous PanLex sources belong to the Apertium family, which has been establishing links to other data.