A new version of the corpus used by the KITAB team is now available to download at Zenodo, an Open Science platform that supports Open Access. This is the second release developed by the OpenITI organization. It is also accessible at GitHub. The current release features 7,119 books, including all versions and editions (1,464,011,669 words), of which 4,285 are unique works written by 1,833 authors. Among these, 446 books are in OpenITI mARkdown. Moreover, the project team has made corrections to the book metadata. Major corrections are noted in the release note, which also provides statistics on the corpus, as well as a list of current and past contributors to the corpus.
Users will be able to read the texts in any text reader and learn to code using thousands of Arabic works.
A new version of the corpus used by the KITAB team is now available to download at Zenodo, an Open Science platform that supports Open Access. This is the second release developed by the OpenITI organization. It is also accessible at GitHub. The current release features 7,119 books, including all versions and editions (1,464,011,669 words), of which 4,285 are unique works written by 1,833 authors. Among these, 446 books are in OpenITI mARkdown. Moreover, the project team has made corrections to the book metadata. Major corrections are noted in the release note, which also provides statistics on the corpus, as well as a list of current and past contributors to the corpus.
Users will be able to read the texts in any text reader and learn to code using thousands of Arabic works.