In the field of Digital Cultural Heritage the data produced and used will include the following 

 

 

Modules and components for the VI-SEEM scientific application environment

for individual operations, and docker containers for every module that can be shared and/or reused, e.g., 3D model cleaning, generating and streaming a 3D model, Interactive 3D Museum tours and web UI creation. Also OCR module, and Databases of complex datasets, e.g., handwritten Arabic, Hebrew and Karamanlidika texts - for searching and identifying of phonetic varieties of the indexed lexemes, as well as the finding of grammatical and formational suffixes..

 

Documentation and analysis datasets of structures, works of art and artefacts.

IWeb UI for new tools for modelling of geoelectrical tomographic data, subsurface reconstruction and imaging; datasets of CH artefacts; regional datasets like MEGA Jordan GPS/ geo-referenced data.

3D visualization and analysis

This category involves online visualization viewers for x3D, 3D pdf of image-based 3D reconstructions (Structure-from-Motion/photogrammetric techniques) and RTi ptm files, as well as MS Word and pdf files.

Metadata

of the Digital Cultural Heritage community will be mostly freely exchangeable and open, except from a few cases of copyrighted material - in particular ancient coins, rare books and unpublished Ptolemaic inscriptions. The VI-SEEM Cultural Heritage metadata shall follow the CIDOC-CRM RDF (Dioptra), ARC2 triple store, ISBD –M, and UNIMARC (BVL) standards, respectively. Metadata will mostly be generated from the operation of Digital Libraries, the application of semantic referencing and annotation, users' annotations of digitized artifacts and reconstructed historical objects, as well as from the publication of databases and use of OCR tools. Metadata standardization, e.g., Dublin Core and derived/related standards, XML, is important as it will allow for their mapping, e.g., Open CV and MINT, and interoperability across platforms, e.g., Spark SQL, ASCII.

Available Pre-Processed data

Description: Electronic Corpus of Karamanlidika Texts: The dataset aims at the electronical publication Karamanlidika texts in both the original Greek writing and Latin transcription and to provide the users with a database which allows the searching and identifiying of phonetic varieties of the indexed lexemes, as well as the finding of grammatical and formational suffixes. Thanks to the indexation of lexemes and morphemes, lexical compatibility can be as easily researched as morphological combinations.

Access dataset though repository  Access dataset through Clowder

NOTE: To access datasets through Clowder you need to be a registered user!

Source: ELKA - Karamanlidika Texts
Charge: Free
Processing Level: Processed Data
Use Licence: CC0 1.0 Universal
Contact: Matthias Kappler
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

Description: The Banatica database already contains 1000 bibliographic descriptions and 200 fulltext scanned books. The existing database aim is to facilitate the development and establishment of a virtual database of patrimonial books, accessible on both sides of the border for enabling more efficient cross-border cooperation in educational and research sector. The partners will develop together a common virtual database of rare patrimonial books which can be extended to other type of documents (postal cards, coins, maps) having Banat as a topic, focusing on promotion of common cultural heritage of cross border area, which aim to consolidate the scientific dimension of the intellectuals’ offspring of Banat Region

 Access dataset through repository Access dataset through Clowder

NOTE: To access datasets through Clowder you need to be a registered user!

Source: BVL - Banatica Database (200 Digitized Books)
Charge: Free
Processing Level: Processed Data
Use Licence: Creative Commons Attribution 4.0 International (CC BY 4.0)
Contact: Delia Pârșan
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

Description: Three-dimensional (3-D) inversion of surface Electrical Resistivity Tomography (ERT) data in order to automatically determine a 3-D resistivity subsurface model. The collection contains five datasets.

Access datagroup through repository Access datagroup through Clowder 

NOTE: To access datasets through Clowder you need to be a registered user!

Source: 3DINV
Charge: Free
Processing Level: Unprocessed Data
Use Licence: Creative Commons Attribution 4.0 International (CC BY 4.0)
Contact: Angelos Hliaoutakis
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

Description: AutoGR application is suited for large image datasets, such as the aerial photographs collected with UAVs or during systematic aerial surveys. The GRID system is going to speed up the georeferencing process. AutoGR can be used as online service for image georeferencing.

Access datagroup through repository Access datagroup through Clowder 

NOTE: To access datasets through Clowder you need to be a registered user!

Source: AutoGR
Charge: Free
Processing Level: Unprocessed Data
Use Licence: Creative Commons Attribution 4.0 International (CC BY 4.0)
Contact: Gianluca Cantoro
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

 

Note: To access this dataset you need to have authorised access to the data collection.

Description: The Historical Arabic Documents Dataset for Recognition Systems
Annotation on sub-word level of five books written by different writers from the years 1088-1451.

Access datagroup through Clowder

NOTE: To access datasets through Clowder you need to be a registered user!

Source: MANUSCRIPT
Charge: Free
Processing Level: Unprocessed Data
Use Licence: Creative Commons Attribution 4.0 International (CC BY 4.0)
Contact: Jihad El-Sana
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

 

Note: To access this dataset you need to have authorised access to the data collection.

Description: CNN Features for Remote Sensing Image Classification
Tools and resources for remote sensing image classification using convolutional neural networks (convnets).
Collection SAT CNN Models contains code and pretrained convnet models for classification of satellite images. The convnets are trained on publicly available SAT-4 and SAT-6 datasets of satellite images (http://csc.lsu.edu/~saikat/deepsat/). Collection contains the following datasets:
- SAT-4 Models - CNN models trained on SAT-4 dataset
- SAT-6 Models - CNN models trained on SAT-6 dataset
- Tools & Results - code for convnet training and image classification, conference paper and presentation

Access dataset through repository Access dataset through Clowder   

NOTE: To access datasets through Clowder you need to be a registered user!

Source: CNN Features for Remote Sensing Image Classification
Charge: Free
Processing Level: Unprocessed Data
Use Licence: Creative Commons Attribution 4.0 International (CC BY 4.0)
Contact: Vladimir Risojević
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it. 

Description: The Centre for the Study of Ancient Documents was established in 1995 under the auspices of Oxford University's Faculty of Literae Humaniores to provide a focus for the study of ancient documents within Oxford. The Centre provides a home for Oxford University's epigraphical archive, which includes one of the largest collections of squeezes (paper impressions) of Greek inscriptions in the world, together with the Haverfield archive of Roman inscriptions from Britain, and a substantial photographic collection. The strengths of the epigraphical archive lie in its broad coverage of early Greek inscriptions, Attic epigraphy and the Hellenistic world. Individual sites well represented in the archive include Chios, Samos, Priene, Rhodes, and Samothrace. The material in the archive is currently being reorganised and catalogued.

Access RTi datasets through repository Access RTi datasets through Clowder 

NOTE: To access datasets through Clowder you need to be a registered user!

Source: The Centre for the Study of Ancient Documents Archive, Oxford
Charge: Free
Processing Level: Unprocessed Data
Use Licence: Creative Commons Attribution 4.0 International (CC BY 4.0)
Contact: Kyriakos Savvopoulos
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it. 

Available Documentation and analysis data

Description: VISTA - Bibliotheca Alexandrina: Popularize the museum experience by adding many ways to simplify and present archaeological and historical data. By utilizing advance programming and graphics techniques, we aim to make a tool that complements and enriches the real museum, substitutes for the inability to visit it, adds new layers and means of storytelling, connects multiple museums worldwide, and that can be used remotely by anyone in the world. The datagroup consists of one dataset, two documents and two presentations.

Access datagroup through Clowder 

NOTE: To access datasets through Clowder you need to be a registered user!

Source: VISTA
Charge: Free
Processing Level: Unprocessed Data
Use Licence: Creative Commons Attribution 4.0 International (CC BY 4.0)
Contact: Mohammed Elfarargy
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

Available 3D visualization and analysis data

Description: Fourier Transform Infrared Spectra of two sets of samples: 1. Wall paintings 2. Marble Sculpture Scientific Contact: Maram Na'es, Technical University Berlin, Department of Optics and Atomic Physics/ and Synchrotron-light for Experimental Science and Applications in the Middle East (SESAME).

Access datagroup through repository  Access datagroup through Clowder

NOTE: To access datasets through Clowder you need to be a registered user!

Source: PETRA
Charge: Free
Processing Level: Processed Data
Contact: Maram Na'es
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

Available Metadata

Description: The metadata schema developed by the Science and Technology for Archaeology Research Center (STARC) has the goal to enable data interoperability and access to the digital resources stored in the local repository. Its structure allows retrieving models, activities, decision and answers the research question on how data can be used for data interpretation and re-used to perform further analysis and post-processing of raw data. The datasets stored in STARC repository refer to 2D and 3D cultural heritage data including archaeological sites, museum objects and architectonic elements. The schema is mostly based on LIDO and CARARE ones and is CIDOC-CRM compliant. The novelty of this metadata schema is the subset of metadata that has been designed to allow recording the information about the provenance of the digital objects, a particularly important aspect when the objects are 3D digital replicas of cultural objects.

Access datagroup through Clowder 

NOTE: To access datasets through Clowder you need to be a registered user!

Source: STARC - Dioptra
Charge: Free
Contact: Valentina Vassallo
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.