Not able to query lots of strings in RDF(TheSession) #9

candlecao · 2024-09-08T01:34:39Z

For example, you can not query the session named "Hurley’s Irish Pub" by:

SELECT ?session
WHERE {
  ?session wdt:P2561 "Hurley’s Irish Pub" .
  ?session rdf:type <https://thesession.org/sessions> .
}

But you can make it by adding "@en": ?session wdt:P2561 "Hurley’s Irish Pub"@en .
The reason is due to the modification:

The text was updated successfully, but these errors were encountered:

candlecao · 2024-09-08T01:44:23Z

I don't quite agree on this rendering because:
(1) We can not guarantee that all of these are definitely in English.
(2) It will cause burden to LLM2SPARQL, intensifying the inaccuracy.
(3) We can use English as the default language so that there is no need to specify this; for other languages, we may supplement with tags such as @zh for Chinese @fr for French...

@fujinaga Hi, Ich, do you agree?

Yueqiao12Zhang · 2024-09-13T14:40:04Z

@fujinaga

fujinaga · 2024-09-13T14:57:04Z

There should always be a language tag in every string. We can always instruct ChatGPT to append the language tags in SPARQL queries.

Yueqiao12Zhang · 2024-09-13T15:01:35Z

Ok. Does this mean that I have to automatically detect the language of every string in my script?

fujinaga · 2024-09-13T15:50:56Z

No. For each database we import, we should know which language it's in.
For now you can default always to @en. If we are storing chant text from CantusDB, that would be in Latin.

ahankinson · 2024-09-16T10:10:15Z

There are several codes that you can use for non-coded languages:

Type: script
Subtag: Zyyy
Description: Code for undetermined script
Added: 2005-10-16
%%
Type: script
Subtag: Zzzz
Description: Code for uncoded script
Added: 2005-10-16
%%
Type: language
Subtag: und
Description: Undetermined
Added: 2005-10-16
Scope: special

https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry

Note: "und should not be used unless a language tag is required and language information is not available or cannot be determined. Omitting the language tag (where permitted) is preferred. This subtag may also be useful when matching language tags in certain situations. Where xml:lang="" is allowed by the markup, it is better to use that rather than und"

From a search for "und" here: https://r12a.github.io/app-subtags/

See: https://www.w3.org/International/questions/qa-no-language#undetermined

candlecao · 2024-09-19T23:36:44Z

Thank you @ahankinson . Could you please give me some vivid examples plus explanation, which incorporate some tag in RDF

ahankinson · 2024-09-20T06:26:24Z

Could you please give me some vivid examples plus explanation, which incorporate some tag in RDF

No, because you can use Google as well as I can. :-)

candlecao added the priority: high high priority label Sep 8, 2024

candlecao assigned candlecao and Yueqiao12Zhang Sep 8, 2024

Yueqiao12Zhang mentioned this issue Sep 13, 2024

Not all types of literals are specified in RDF, and strings do not have language tags DDMAL/linkedmusic-datalake#164

Closed

candlecao added medium and removed priority: high high priority labels Oct 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not able to query lots of strings in RDF(TheSession) #9

Not able to query lots of strings in RDF(TheSession) #9

candlecao commented Sep 8, 2024

candlecao commented Sep 8, 2024 •

edited

Loading

Yueqiao12Zhang commented Sep 13, 2024

fujinaga commented Sep 13, 2024

Yueqiao12Zhang commented Sep 13, 2024

fujinaga commented Sep 13, 2024

ahankinson commented Sep 16, 2024

candlecao commented Sep 19, 2024

ahankinson commented Sep 20, 2024

Not able to query lots of strings in RDF(TheSession) #9

Not able to query lots of strings in RDF(TheSession) #9

Comments

candlecao commented Sep 8, 2024

candlecao commented Sep 8, 2024 • edited Loading

Yueqiao12Zhang commented Sep 13, 2024

fujinaga commented Sep 13, 2024

Yueqiao12Zhang commented Sep 13, 2024

fujinaga commented Sep 13, 2024

ahankinson commented Sep 16, 2024

candlecao commented Sep 19, 2024

ahankinson commented Sep 20, 2024

candlecao commented Sep 8, 2024 •

edited

Loading