Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Browse files
Browse the repository at this point in the history
…2038) * add accepted file formats * Update data-formats.md * update sipi path * rename data-formats.md to file-formats.md * update index.md * fix footnote * reset scala setting
- Loading branch information
irinaschubert
committed
Apr 12, 2022
1 parent
521150f
commit f72e7a0
Showing
7 changed files
with
34 additions
and
36 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,24 @@ | ||
<!--- | ||
* Copyright © 2021 - 2022 Swiss National Data and Service Center for the Humanities and/or DaSCH Service Platform contributors. | ||
* SPDX-License-Identifier: Apache-2.0 | ||
--> | ||
|
||
# File Formats in DSP-API | ||
|
||
Currently, only a limited number of file formats is accepted to be uploaded onto DSP. Some metadata is extracted from the files during the ingest but the file formats are not validated. Only image file formats are currently migrated into another format. Both, the migrated version of the file and the original are kept. | ||
|
||
The following table shows the accepted file formats: | ||
|
||
| Category | Accepted format | Converted during ingest? | | ||
| --------------------- | ------------------------- | -------------------------------------------------------------------------- | | ||
| Text, XML<sup>1</sup> | TXT, XML, XSL, XSD | No | | ||
| Tables | CSV, XLS, XLSX | No | | ||
| 2D Images | JPEG, PNG, TIFF, JP2 | Yes, converted to JPEG 2000 by [Sipi](https://github.com/dasch-swiss/sipi) | | ||
| Audio | MPEG (MP3), MP4, WAV | No | | ||
| Video | MP4 | No | | ||
| Office | PDF, DOC, DOCX, PPT, PPTX | No | | ||
| Archives | ZIP, TAR, ISO, GZIP, 7Z | No | | ||
|
||
|
||
1: If your XML files represent text with markup (e.g. [TEI/XML](http://www.tei-c.org/)), | ||
the recommended approach is to allow Knora to store it as [Standoff/RDF](standoff-rdf.md). |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters