Discussion
Loading...

Post

  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Till Grallert
@tillgrallert@digitalcourage.social  ·  activity timestamp last month

This week I have been digging around in the #InternetArchive looking for digitised Arabic periodicals. With a bit of #Rstats and far too many hours with #XSLT and #TEI/XML spent on identifying titles based on the very patchy metadata provided by uploaders, there are quite some exciting finds.

The API returned some 4500+ items for the keywords "جريدة", "مجلة", and "صحيفة", of which I could identify 781 as pertaining to 100 individual Arabic periodicals published before 1930. Links to all of these have been uploaded to #Wikidata, which will increase their visibility to scholars and the interested public.

There are, of course, thousands of items for which I couldn’t programmatically establish a title with sufficient certainty, let alone try and link them to existing records without actually looking at the digital facsimile and reading the information provided on front-pages and mastheads.

Check out https://query-chest.toolforge.org/redirect/wnywBGxzyyiukWWYiIQqi0smeUCu6AggKG4mOME68i3 to see the results.

#ArabPeriodicalStudies #PeriodicalStudies #DigitalHumanities

https://query-chest.toolforge.org/redirect/wnywBGxzyyiukWWYiIQqi0smeUCu6AggKG4mOME68i3
Sorry, no caption provided by author
Sorry, no caption provided by author
Sorry, no caption provided by author
  • Copy link
  • Flag this post
  • Block
Till Grallert
@tillgrallert@digitalcourage.social replied  ·  activity timestamp 3 weeks ago

The diversity of metadata provided by users uploading items to @internetarchive caused me to (again) make the most common error in API calls: making them too specific. I had been too optimistic and limited my search to items explicitly to those marked as being in Arabic. Removing this condition, returned another 5000+ items for Arabic periodicals based on the same keywords.

After all is done, I was able to identify 2890 items (unique ARK IDs), or roughly one third of all items, and link them to 119 individual periodical titles on #Wikidata.

  • Copy link
  • Flag this comment
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.1-alpha.8 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct
Home
Login