Wikidata:Requests for permissions/Bot/K10PlusBot
- The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved--Ymblanter (talk) 20:02, 11 November 2020 (UTC)[reply]
K10PlusBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: JakobVoss (talk • contribs • logs)
Task/s: The bot User:K10PlusBot will look up Wikidata items with ISBN, query these ISBN in K10plus library union catalog (K10plus (Q100884768)) and add the K10plus record identifier ("PPN") if a unique record wit this ISBN was found.
Code: A first implementation of the bot is available at https://github.com/gbv/K10PlusBot as simple shell script. A more elaborated extension might be created to improve performance (cache which ISBN has not been found so don't query the same ISBN again) without changing the program logic.
Function details: The bot will check around 43300 existing statements and later run regularly to check ISBN numbers newly added to Wikidata. A rough estimate of statements to be added is 10000.
--JakobVoss (talk) 21:18, 29 October 2020 (UTC)[reply]
- i basically like your idea to improve the linkage between Wikidata and library catalogs especially for books. but i am at the moment a little bit afraid of adding a lot of additional data to items about "books", unless there isn't an appropriate and wide accepted data model for books in Wikidata. I really prefer and like the simplified frbr-like way of work -> edition/manifestation -> item (if necessary). isbn and the different identifiers from union catalogs (as k10plus or the one i am looking for OBV edition ID (P6201) are typically - as you know - identifiers for the edition[expression/manifestation]-level. but on wikidata the are many simplified "book" items or even work-items containing an isbn.
- so my question is, how would you deal with the following item:
- Fear (Q543773)ISBN-13 (P212)3-596-10494-7 → K10Plus gives you 3 k10plus-IDs. Each k10plus-ID represents an own edition of Fear (Q543773), so there were need for a lot of clean up. (Remove isbn from the work-item, create new edition items, add the isbn and the different k10plus-ids and so on.)
- maybe to avoid a lot of work and to avoid adding a lot of edition/manifestion-data on work-level-items, the main sparql-query for your bot could be reduced only the edition-items:
- Try it!
SELECT ?qid ?isbn { ?qid wdt:P957 ?isbn ; wdt:P31 wd:Q3331189. FILTER( STRSTARTS( ?isbn, "$PREFIX" ) ) . FILTER NOT EXISTS { ?qid wdt:P6721 ?ppn } } LIMIT 10
- --Mfchris84 (talk) 20:56, 29 October 2020 (UTC)[reply]
- Thanks for feedback, I modified the query as suggested! -- JakobVoss (talk) 21:46, 29 October 2020 (UTC)[reply]
- I am going to approve the bot in a couple of days provided no objections have been raised.--Ymblanter (talk) 19:38, 9 November 2020 (UTC)[reply]
- Thanks for feedback, I modified the query as suggested! -- JakobVoss (talk) 21:46, 29 October 2020 (UTC)[reply]