Release 5.6.0 Toggle Dark/Light/Auto mode Toggle Dark/Light/Auto mode Toggle Dark/Light/Auto mode Back to homepage

File Types

Upload Container Files

The following container types are supported as both upload containers and for processing:

ExtensionKind of DocumentMedia/MIME Type
.gzGZip compressed archiveapplication/gzip
.bz2
.iso
.mbox
.pstMicrosoft Outlook .pst files 2000 / 2003 / 2007 / 2010 / 2013 / 2016 / 2019 / 2021 / Outlook 365
.rarRAR archiveapplication/vnd.rar
.tarTar archive
.vdi
.vhd
.vmdk
.zGNU-compressed files
.zipZIP archiveapplication/zip
.7z7-zip archiveapplication/x-7z-compressed
Canopy does not support multipart archive files, which include formats such as multipart PST and multipart ZIP files. If you have a multipart archive, please combine the parts into a single archive file before uploading.

Supported Files

The following file types are supported and tested by Canopy for processing. Canopy takes a best effort approach for processing files not contained on this list, therefore, supported files extend beyond this list:

ExtensionKind of DocumentMedia/MIME Type
.accdbMicrosoft Access 2007+
.bak, .mdfMicrosoft SQL
.bmpBitmapimage/bmp
.csvCSV (Comma-Separated Values)text/csv
.datDAT (DAT files)
.dcmDigital Imaging and Communications in Medicine (DICOM)
.docMicrosoft Wordapplication/msword
.docxMicrosoft Word XML Formatapplication/vnd.openxmlformats-officedocument.wordprocessingml.document
.dotMicrosoft Word Templateapplication/msword
.dotmMicrosoft Document Macro Enabled Templateapplication/vnd.ms-word.template.macroenabled.12
.dotxMicrosoft Document XML Templateapplication/vnd.openxmlformats-officedocument.wordprocessingml.template
.emfEnhanced Metafilesimage/x-emf
.emlEmailsmessage/rfc822
.gifGraphics Interchange Format (GIF)image/gif
.gzGZip compressed archiveapplication/gzip
.heif/.heicHigh Efficiency File Format (HEIF) Familyimage/heic; image/heic-sequence; image/heif; image/heif-sequence
.iso
.jpg, jpeg, jpeJoint Photographic Experts Group (JPEG)image/jpeg
.keyApple iWorks Keynoteapplications/x-iwork-keynote-sffkey
.mbox
.mdbMicrosoft Access Format < 2007
.msgMicrosoft Message File
.numbersApple iWorks Numbers < version 12applications/x-iwork-numbers-sffnumbers
.pagesApple iWorks Pagesapplications/x-iwork-pages-sffpages
.pdfAdobe Portable Document Formatapplication/pdf
.pngPortable Network Graphics (PNG)image/png
.potMicrosoft PowerPoint Templateapplication/vnd.ms-powerpoint
.potmMicrosoft PowerPoint Macro Enabled Templateapplication/vnd.ms-powerpoint.template.macroenabled.12
.potxMicrosoft PowerPoint XML Templateapplication/vnd.openxmlformats-officedocument.presentationml.template
.ppsMicrosoft PowerPoint Slide Showapplication/vnd.ms-powerpoint
.ppsmMicrosoft PowerPoint Macro Enabled Slide Showapplication/vnd.ms-powerpoint.slideshow.macroenabled.12
.ppsxMicrosoft PowerPoint XML Slide Showapplication/vnd.openxmlformats-officedocument.presentationml.slideshow
.pptMicrosoft PowerPointapplication/vnd.ms-powerpoint
.pptmMicrosoft PowerPoint Macro Enabledapplication/vnd.ms-powerpoint.presentation.macroenabled.12
.pptxMicrosoft PowerPoint XML Formatapplication/vnd.openxmlformats-officedocument.presentationml.presentation
.pstMicrosoft Outlook .pst files 2000 / 2003 / 2007 / 2010 / 2013 / 2016 / 2019 / 2021 / Outlook 365
.psvPSV (Pipe-Separated Values)text/plain; charset=ISO-8859-1
.rarRAR archiveapplication/vnd.rar
.rtfRich Text Format (Text files)application/rtf; text/richtext
.sas7bdatStatistical Analysis System (SAS) Databaseapplication/x-sas-data
.tiffTag Image File Formatimage/tiff
.tsv, .tabTSV / TAB (Tab-Separated Values)text/tab-separated-values; charset=ISO-8859-1
.txtTXT (Text files)text/plain
.vdi
.vhd
.vmdk
.webpWeb Pictureimage/webp
.wpdWordPerfect 6application/vnd.wordperfect; version=6.x
.wmfWindows Metafilesimage/x-wmf
.w51WordPerfect 5.1application/vnd.wordperfect; version=5.1
.xlaMicrosoft Excel Add-Insapplication/vnd.ms-excel
.xlamMicrosoft Excel Macro-Enabledapplication/vnd.ms-excel.addin.macroenabled.12
.xlsMicrosoft Excelapplication/vnd.ms-excel
.xlsbMicrosoft Excel Binary Macro Enabledapplication/vnd.ms-excel.sheet.binary.macroenabled.12
.xlsxMicrosoft Excel XML Formatapplication/vnd.openxmlformats-officedocument.spreadsheetml.sheet
.xltMicrosoft Excel Templateapplication/vnd.ms-excel
.xltxMicrosoft Excel XML Templateapplication/vnd.openxmlformats-officedocument.spreadsheetml.sheet
.xlwMicrosoft Excel Workspaceapplication/vnd.ms-excel
.zGNU-compressed files
.zipZIP archiveapplication/zip
.7z7-zip archiveapplication/x-7z-compressed

Unsupported Files

The following unsupported files will force fail during processing:

File typeExtensions
Database.sqlite
Mailbox Files.ost, .nsf
Spreadsheets.numbers version >= 12
Presentations.ppam
Images.svg
Zeiss CSI images.czi
Aperio SVS images.svs
Aperion fluorescent images.afi

Skipped Files

Once the processing pipeline determines a file matching one of the following Media Types, the file will be marked as skipped and removed from further processing.

Currently, processing skips the following files by media type:

Media/MIME Types/SignatureNon Exhaustive Extension ListKind of Document
application/atom+xml.atomAtom Syndication Format
application/epub+zip.epubElectronic publication (EPUB)
application/font-sfnt.ttf, .otf, .ttcTrueType or OpenType
application/geotopic.xml, .rdf, .json, .html, .csvISO/TS 19139-1:2019 Geographic Information
application/java-vm.classJava Class File
application/octet-stream.binUninterpreted binary
application/pkcs7-signature
application/pkcs7-mime
.p7c, .p7z, .p7s, p7mPKCS #7 digital signatures and certificates
application/rss+xml.rss, .xml, .rdfRDF Site Summary (RSS)
application/step.st, .step, .stpISO-10303 STEP data
application/timestamped-data.tsdTimeStampedData
application/vnd.ms-fontobject.eotEmbedded OpenType (EOT)
application/vnd.ms-htmlhelp.chmMicrosoft Compiled HTML Help (CHM)
application/x-dosexec.exeDOS/Windows executable (EXE)
application/x-elf.elfExecutable and Linkable Format (ELF)
application/x-font-adobe-metric.afm, .amfm, .acfmAdobe Multiple Font Metrics Format Files
application/x-font-ttf.ttfTrueType Font
application/x-font-type1.pfa, .pfbPostScript Type 1 Fonts
application/x-hdf.hdf, .he5, .h5Hierarchical Data Format File
application/x-matlab-data.matMATLAB Files
application/x-msdownload.exe, .dll, .ocx, .msi, .msp, .cab, .bat, .com, .scrPortable Executable (PE)
application/x-msdownload; format=pe32.exe, .dll, .ocx, .sysPortable Executable (PE) format for 32-bit Windows
application/x-msdownload; format=pe64exe, .dll, .ocx, .sysPortable Executable (PE) format for 64-bit Windows
application/x-netcdf.nc, .cdfNetwork Common Data Form
application/x-object.o, .obj, .coffObject code files
application/x-sharedlib.so, .dllShared library files
Files starting with a tilde (~) that is followed by a dollar sign ($) and of one of these MIME Types:
application/msword,
application/vnd.openxmlformats-officedocument.wordprocessingml.document,
application/vnd.openxmlformats-officedocument.spreadsheetml.sheet,
application/msexcel,
application/vnd.ms-powerpoint,
application/vnd.openxmlformats-officedocument.presentationml.presentation,
application/pdf
.doc, .docx, .xls, .xlsx, .ppt, .pptx, .pdfWindow Temporary Files, also known as “owner file”
multipart/appledoubleApple Double Resource Files
Exactly “Thumbs.db” in OLE Compound File FormatThumbs.dbHidden Temporary Windows Folder/Directory File
text/x-c++.cpp, .cc, .cxx, .c++, .h, .hpp, .hxx, .hhC++ Source OCde and Header Files
text/x-c++src.cpp, .cxx, .cc, .C, .c++, .CPPC++ Source Code

Currently, processing skips the following files by extension:

  • .out
  • .pack
  • .pbxproj
  • .abcdp
  • .xcuserstate

In older projects, once the processing pipeline determines a file has one of the following extensions, the file will be marked as skipped and removed from further processing:

  • .afm
  • .atom
  • .axf
  • .bin
  • .c++
  • .cc
  • .chm
  • .class
  • .cpp
  • .cxx
  • .dat
  • .dll
  • .elf
  • .eot
  • .epub
  • .exe
  • .fb2
  • .fbz
  • .geot
  • .hdf
  • .ibooks
  • .iso
  • .ko
  • .mat
  • .mod
  • .nc
  • .o
  • .p7c
  • .p7m
  • .p7s
  • .pfa
  • .pfb
  • .prx
  • .puff
  • .rss
  • .sfnt
  • .so
  • .tsd
  • .ttf
  • .woff
  • .woff2

File Types Filtering

Canopy processes and displays all file types, even those not listed in our standard documentation above.

The File Type field in the Filter UI uses a unique value search filter type. When you click on the filter, you’ll see a dropdown list of every unique file type detected in your data.

  • 2,000 or Fewer Unique File Types: Each file type will be listed individually for easy selection.

less_than_2000.png

  • 2,000+ Unique File Types: To keep the browsing experience smooth and the Filter UI easy to navigate, Canopy groups file types into two categories: Standard and Non-Standard. more_than_2000.png more_than_2000_2.png
    • Standard File Types include all file types listed in the Supported, Unsupported, or Skipped categories above. These file types are listed individually in the Filter UI.
    • Non-Standard File Types include all file types not covered in the standard list. These are grouped together in the Filter UI. non_standard.png
      • To find a specific Non-Standard File Type, users must enter the exact file extension (e.g., .xyz) in the search box.
      • Users can filter multiple Non-Standard File Types at once by searching and selecting multiple extensions from the filter search results. non_standard_search.png non_standard_search_result.png