The PanLex database has undergone several incremental design revisions. Among the major ones have been:
- The addition of language varieties to what were only languages.
- The elaboration of the records of sources to include bibliographic, file-type, editor, language-variety, quality, complexity, and other attributes.
- The consolidation of meaning identifiers, word classifications, domain specifications, and metadata into meaning and denotation classifications and properties.
- The creation of source groups.
- The addition of mutability and expressional default names to language varieties.
We expect the database design to continue to undergo occasional changes, and we welcome proposals for improvements.
Potential changes to the database design that are under discussion include:
Source classifications and properties
Above we discuss classifications and properties in general. Additional details specific to source classifications and source properties are discussed here. As of June 02016, these changed had not been implemented.
An idea emerged in May 02015 to reorganize the structure of source records, making them more extensible, better able to accommodate multiple values per column, and more interoperable and/or linkable with other bibliographic records. One possible benefit would be to permit records to be imported from elsewhere, such as OCLC, Library of Congress (possibly under the Z39.50 standard), and Open Library.
This proposal has been accommodated with source classifications and properties. They generalize most of the content of table
Source classifications are stored in table
acs. It has the structure
acs serial primary key ap integer not null ex0 integer ex1 integer not null
acs absorbed the
li column of table
ap (sources). For each record of table
ap having a non-null value of
li, a record was created in table
ex0 is the expression “license” in the DCMI Metadata Terms language variety
art-301, and the 2-character value of
li was converted to the ID of an equivalent expression in either the SPDX License List Identifier language variety
art-298 or the PanLex Intellectual Property Licenses language variety
art-299 as the value of
Source properties are stored in table
app. It has the structure:
app serial primary key ap integer not null ex integer not null sq smallint not null tt text not null
app absorbed columns
ad of table
ap (sources). Their
ex values are the IDs of equivalent expressions in the DCMI Metadata Terms language variety,
art-301. or the PanLex Source Classes and Properties language variety,
The sq column of table app identifies a sequential index. Sources often have multiple authors, and PanLex sources often have multiple titles. Some also have multiple publishers, multiple URLs, etc. Column
sq specifies the order in which they appear.
ap contains each source’s ID, registration date, estimated quality, and group ID. You can use source classifications and properties to record other information about sources. This includes the usual bibliographic facts and PanLex-specific data.
We are not referring here to facts about the editorial assimilation of sources, such as the completion status of assimilation and problems facing the editors of particular sources. These editorial facts are stored in the
aped table instead.