Skip to content

Commit

Permalink
Browse files Browse the repository at this point in the history
  • Loading branch information
barlowrussell committed Jan 3, 2025
2 parents 9958c7f + bf7e379 commit e3549fd
Show file tree
Hide file tree
Showing 12 changed files with 3,828 additions and 323 deletions.
1 change: 0 additions & 1 deletion FORMS.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,6 @@ The value-to-form processing is divided into two steps, implemented as methods:
- `FormSpec.clean`: Normalizes a form chunk.

These methods use the attributes of a `FormSpec` instance to configure their behaviour.

- `brackets`: `{'(': ')'}`
Pairs of strings that should be recognized as brackets, specified as `dict` mapping opening string to closing string
- `separators`: `(';', '/', ',')`
Expand Down
3 changes: 3 additions & 0 deletions NOTES.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
### How to contribute

If you have data on numerals of the Pacific that you would like to contribute to this dataset, you can! Please notify us of the data by opening an [issue](../../issues/). If the numeral information is already published somewhere, you can simply provide the bibliographic information for the publication. If you have a digital copy of the publication (or of the relevant pages), you can also attach that in the issue. If, however, the data are not published anywhere (e.g., they are from your personal field notes), then you can make a pdf including the data and as much metadata as possible: the name of the language, the date and location of where the data were collected, an "author" for the document (e.g., the name of the linguist who collected and compiled the data), and a title for the document (e.g., "Cardinal numerals in Language X" or "Excerpt from Dr. Linguist's field notes on Language X"). The goal is to have an "unpublished manuscript" that is nevertheless nicely citable.
23 changes: 13 additions & 10 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
![image](https://github.com/user-attachments/assets/565bd7c5-1db3-421f-961c-5d66a0f799fe)# CLDF dataset derived from Barlow's "Numerals of the Pacific" from 2024
# CLDF dataset derived from Barlow's "Numerals of the Pacific" from 2024

[![CLDF validation](https://github.com/numeralbank/barlowpacific/workflows/CLDF-validation/badge.svg)](https://github.com/numeralbank/barlowpacific/actions?query=workflow%3ACLDF-validation)

Expand All @@ -9,15 +9,18 @@ If you use these data please cite
> Barlow, Russell (ed.). 2024. Numerals of the Pacific: A collection of numeral terms in Austronesian and Papuan languages.
- the derived dataset using the DOI of the [particular released version](../../releases/) you were using

## How to contribute
## Description

If you have data on numerals of the Pacific that you would like to contribute to this dataset, you can! Please notify us of the data by opening an [issue](../../issues/). If the numeral information is already published somewhere, you can simply provide the bibliographic information for the publication. If you have a digital copy of the publication (or of the relevant pages), you can also attach that in the issue. If, however, the data are not published anywhere (e.g., they are from your personal field notes), then you can make a pdf including the data and as much metadata as possible: the name of the language, the date and location of where the data were collected, an "author" for the document (e.g., the name of the linguist who collected and compiled the data), and a title for the document (e.g., "Cardinal numerals in Language X" or "Excerpt from Dr. Linguist's field notes on Language X"). The goal is to have an "unpublished manuscript" that is nevertheless nicely citable.

This dataset is licensed under a CC-BY-4.0 license

## Description
## Notes

### How to contribute

If you have data on numerals of the Pacific that you would like to contribute to this dataset, you can! Please notify us of the data by opening an [issue](../../issues/). If the numeral information is already published somewhere, you can simply provide the bibliographic information for the publication. If you have a digital copy of the publication (or of the relevant pages), you can also attach that in the issue. If, however, the data are not published anywhere (e.g., they are from your personal field notes), then you can make a pdf including the data and as much metadata as possible: the name of the language, the date and location of where the data were collected, an "author" for the document (e.g., the name of the linguist who collected and compiled the data), and a title for the document (e.g., "Cardinal numerals in Language X" or "Excerpt from Dr. Linguist's field notes on Language X"). The goal is to have an "unpublished manuscript" that is nevertheless nicely citable.


This dataset is licensed under a CC-BY-4.0 license

## Statistics

Expand All @@ -27,10 +30,10 @@ This dataset is licensed under a CC-BY-4.0 license
![Concepticon: 100%](https://img.shields.io/badge/Concepticon-100%25-brightgreen.svg "Concepticon: 100%")
![Source: 100%](https://img.shields.io/badge/Source-100%25-brightgreen.svg "Source: 100%")

- **Varieties:** 2,783 (linked to 1,401 different Glottocodes)
- **Concepts:** 212 (linked to 123 different Concepticon concept sets)
- **Lexemes:** 37,092
- **Sources:** 444
- **Varieties:** 3,058 (linked to 1,475 different Glottocodes)
- **Concepts:** 235 (linked to 128 different Concepticon concept sets)
- **Lexemes:** 39,765
- **Sources:** 502
- **Synonymy:** 1.09

## Possible Improvements:
Expand All @@ -54,4 +57,4 @@ Christoph Rzymski | @chrzyki | patron, maintainer | Other

The following CLDF datasets are available in [cldf](cldf):

- CLDF [Wordlist](https://github.com/cldf/cldf/tree/master/modules/Wordlist) at [cldf/cldf-metadata.json](cldf/cldf-metadata.json)
- CLDF [Wordlist](https://github.com/cldf/cldf/tree/master/modules/Wordlist) at [cldf/cldf-metadata.json](cldf/cldf-metadata.json)
10 changes: 5 additions & 5 deletions cldf/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,8 +12,8 @@ property | value
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF Wordlist](http://cldf.clld.org/v1.0/terms.rdf#Wordlist)
[dc:license](http://purl.org/dc/terms/license) | https://creativecommons.org/licenses/by/4.0/
[dcat:accessURL](http://www.w3.org/ns/dcat#accessURL) | https://github.com/numeralbank/barlowpacific
[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) | <ol><li><a href="https://github.com/numeralbank/barlowpacific/tree/f408638">numeralbank/barlowpacific v1.5-17-gf408638</a></li><li><a href="https://github.com/glottolog/glottolog/tree/v5.0">Glottolog v5.0</a></li><li><a href="https://github.com/concepticon/concepticon-data/tree/v3.2.0">Concepticon v3.2.0</a></li><li><a href="https://github.com/cldf-clts/clts/tree/v2.3.0">CLTS v2.3.0</a></li></ol>
[prov:wasGeneratedBy](http://www.w3.org/ns/prov#wasGeneratedBy) | <ol><li><strong>lingpy-rcParams</strong>: <a href="./lingpy-rcParams.json">lingpy-rcParams.json</a></li><li><strong>python</strong>: 3.12.5</li><li><strong>python-packages</strong>: <a href="./requirements.txt">requirements.txt</a></li></ol>
[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) | <ol><li><a href="https://github.com/numeralbank/barlowpacific/tree/d9ff71c">numeralbank/barlowpacific v1.6-128-gd9ff71c</a></li><li><a href="https://github.com/glottolog/glottolog/tree/v5.1">Glottolog v5.1</a></li><li><a href="https://github.com/concepticon/concepticon-data/tree/v3.2.0">Concepticon v3.2.0</a></li><li><a href="https://github.com/cldf-clts/clts/tree/v2.3.0">CLTS v2.3.0</a></li></ol>
[prov:wasGeneratedBy](http://www.w3.org/ns/prov#wasGeneratedBy) | <ol><li><strong>lingpy-rcParams</strong>: <a href="./lingpy-rcParams.json">lingpy-rcParams.json</a></li><li><strong>python</strong>: 3.13.1</li><li><strong>python-packages</strong>: <a href="./requirements.txt">requirements.txt</a></li></ol>
[rdf:ID](http://www.w3.org/1999/02/22-rdf-syntax-ns#ID) | barlowpacific
[rdf:type](http://www.w3.org/1999/02/22-rdf-syntax-ns#type) | http://www.w3.org/ns/dcat#Distribution

Expand All @@ -23,7 +23,7 @@ property | value
property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF FormTable](http://cldf.clld.org/v1.0/terms.rdf#FormTable)
[dc:extent](http://purl.org/dc/terms/extent) | 37092
[dc:extent](http://purl.org/dc/terms/extent) | 39765


### Columns
Expand All @@ -50,7 +50,7 @@ Name/Property | Datatype | Description
property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF LanguageTable](http://cldf.clld.org/v1.0/terms.rdf#LanguageTable)
[dc:extent](http://purl.org/dc/terms/extent) | 2783
[dc:extent](http://purl.org/dc/terms/extent) | 3059


### Columns
Expand All @@ -75,7 +75,7 @@ Name/Property | Datatype | Description
property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF ParameterTable](http://cldf.clld.org/v1.0/terms.rdf#ParameterTable)
[dc:extent](http://purl.org/dc/terms/extent) | 212
[dc:extent](http://purl.org/dc/terms/extent) | 235


### Columns
Expand Down
12 changes: 6 additions & 6 deletions cldf/cldf-metadata.json
Original file line number Diff line number Diff line change
Expand Up @@ -13,13 +13,13 @@
{
"rdf:about": "https://github.com/numeralbank/barlowpacific",
"rdf:type": "prov:Entity",
"dc:created": "v1.5-17-gf408638",
"dc:created": "v1.6-128-gd9ff71c",
"dc:title": "Repository"
},
{
"rdf:about": "https://github.com/glottolog/glottolog",
"rdf:type": "prov:Entity",
"dc:created": "v5.0",
"dc:created": "v5.1",
"dc:title": "Glottolog"
},
{
Expand All @@ -42,7 +42,7 @@
},
{
"dc:title": "python",
"dc:description": "3.12.5"
"dc:description": "3.13.1"
},
{
"dc:title": "python-packages",
Expand All @@ -54,7 +54,7 @@
"tables": [
{
"dc:conformsTo": "http://cldf.clld.org/v1.0/terms.rdf#FormTable",
"dc:extent": 37092,
"dc:extent": 39765,
"tableSchema": {
"columns": [
{
Expand Down Expand Up @@ -161,7 +161,7 @@
},
{
"dc:conformsTo": "http://cldf.clld.org/v1.0/terms.rdf#LanguageTable",
"dc:extent": 2783,
"dc:extent": 3059,
"tableSchema": {
"columns": [
{
Expand Down Expand Up @@ -239,7 +239,7 @@
},
{
"dc:conformsTo": "http://cldf.clld.org/v1.0/terms.rdf#ParameterTable",
"dc:extent": 212,
"dc:extent": 235,
"tableSchema": {
"columns": [
{
Expand Down
Loading

0 comments on commit e3549fd

Please sign in to comment.