File Kind
License: PRO+ (Professional Edition or higher)
Plugin Type: Index Plugin
Author: Diskover Data, Inc.
Overview
The File Kind plugin automatically categorizes files into logical groups based on their file extensions during Diskover indexing. Instead of searching through hundreds of individual file extensions, you can simply search for "Video" or "Document" to find what you need.
The plugin adds a filekind field to each indexed file, making it easy to:
Generate storage reports by file category
Build targeted cleanup policies for specific file types
Create Smart Searches for quick access to common file groups
Understand your storage composition at a glance
Use Cases
Storage Administrators
Generate reports showing storage consumption by file type to identify what's consuming disk space
Target specific categories (e.g., Trash, Compressed) for archival or deletion policies
Monitor growth trends across different file categories
Compliance & Governance Teams
Track document distribution across storage systems for GDPR and data classification initiatives
Identify and classify data sets based on file type categories
Support data retention policy enforcement
Media & Creative Teams
Quickly locate video, audio, and image assets across storage
Manage media asset libraries with category-based organization
Track creative file distribution across projects
Development Teams
Analyze code file distribution across projects
Monitor programming and application file sprawl
Identify temporary and system files consuming developer storage
Understanding File Categories
The File Kind plugin works by matching file extensions to predefined categories. When a file is indexed, the plugin examines its extension and assigns it to the appropriate category. Files without extensions or with unrecognized extensions are assigned to "Other."
Default File Categories
The plugin includes 12 predefined categories covering common file types:
Category | Extensions |
|---|---|
Document | doc, docx, odt, pdf, tex, wpd, wks, txt, rtf, key, odp, pps, ppt, pptx, ods, xls, xlsm, xlsx |
Image | ai, bmp, gif, ico, jpeg, jpg, png, ps, psd, psp, svg, tif, tiff, exr, tga |
Video | 3g2, 3gp, avi, flv, h264, m4v, mkv, qt, mov, mp4, mpg, mpeg, rm, swf, vob, wmv, ogg, ogv, webm |
Audio | au, aif, aiff, cda, mid, midi, mp3, m4a, mpa, ogg, wav, wma, wpl |
Application | apk, exe, bat, bin, cgi, pl, gadget, com, jar, msi, py, wsf |
Programming | c, cgi, pl, class, cpp, cs, h, java, php, py, sh, swift, vb |
Internet | asp, aspx, cer, cfm, cgi, pl, css, htm, html, js, jsp, part, php, py, rss, xhtml |
System | bak, cab, cfg, cpl, cur, dll, dmp, drv, icns, ico, ini, lnk, msi, sys, tmp, vdi, raw |
Data | csv, dat, db, dbf, log, mdb, sav, sql, tar, xml |
Disc | bin, dmg, iso, toast, vcd, img |
Compressed | 7z, arj, deb, pkg, rar, rpm, tar, gz, z, zip |
Trash | old, trash, tmp, temp, junk, recycle, delete, deleteme, clean, remove |
How Categories Are Assigned
The plugin processes files using the following logic:
Extracts the file extension (case-insensitive)
Searches through configured categories for a match
Assigns the first matching category found
Defaults to "Other" if no match or no extension exists
Requirements
Python Dependencies
This plugin has no external Python dependencies. All required functionality is provided by Python's standard library, making it lightweight and fast.
System Requirements
Python 3.9 or higher
Diskover indexer with plugin support enabled
Installation
Step 1: Configure the Plugin
Navigate to Diskover Admin Panel > Plugins > Index Plugins > File Kind
Enable the plugin
Review and customize the file categories if needed (see Configuration section)
Save the configuration
Step 2: Enable in Index Task Configuration
Navigate to Diskover > Configurations > select your configuration (e.g., Default)
Scroll to the bottom to find Index Plugins Enablement
Enable the File Kind plugin
Save the configuration
The plugin will now run automatically during any scan using this configuration.
Configuration
Configuration Parameters
Parameter | Type | Default | Description |
|---|---|---|---|
filekinds | Dictionary | See default categories above | Dictionary mapping category names to lists of file extensions |
Creating Custom Categories
You can customize the file categories to match your organization's needs. This is particularly useful for industry-specific file types that aren't covered by the defaults.
To add a new category:
Add a new entry to the filekinds configuration with your category name and associated extensions:
'CAD': ['dwg', 'dxf', 'step', 'stp', 'iges', 'igs', 'stl', 'obj', 'fbx', '3ds']
To modify an existing category:
Update the extension list for any existing category to add or remove file types.
Industry-Specific Examples
Life Sciences Configuration
'Genomics': ['fastq', 'fasta', 'fa', 'fq', 'bam', 'sam', 'vcf', 'bed', 'gff', 'gtf'] 'Microscopy': ['nd2', 'czi', 'lif', 'oib', 'oif', 'ims', 'vsi'] 'Analysis': ['r', 'rmd', 'ipynb', 'py', 'mat']
Media & Entertainment Configuration
'Video': ['mov', 'mp4', 'mxf', 'avi', 'mkv', 'prores', 'dnxhd'] 'Image': ['exr', 'dpx', 'tiff', 'png', 'jpg', 'psd', 'ai'] 'Audio': ['wav', 'aiff', 'mp3', 'aac', 'flac'] 'Project': ['aep', 'prproj', 'nk', 'hip', 'blend', 'ma', 'mb']
Architecture & Engineering Configuration
'CAD': ['dwg', 'dxf', 'rvt', 'rfa', 'ifc', 'pln', 'skp'] '3DModels': ['step', 'stp', 'iges', 'stl', 'obj', '3ds', 'fbx'] 'GIS': ['shp', 'geojson', 'kml', 'kmz', 'gpx', 'gdb']
Indexed Fields
Elasticsearch Field Mapping
Field Path | ES Type | Description |
|---|---|---|
filekind | keyword | The category label assigned to the file based on its extension |
Example Document
{
"name": "quarterly_report.pdf",
"extension": "pdf",
"filekind": "Document"
}
{
"name": "project_render.mov",
"extension": "mov",
"filekind": "Video"
}
Searching in Diskover
The filekind field enables simple, intuitive searches by file category.
Basic Search Examples
Query | Description |
|---|---|
| Find all document files |
| Find all video files |
| Find all image files |
| Find all compressed archives |
| Find potential cleanup candidates |
| Find files with unrecognized extensions |
Building Smart Searches and Reports
One of the most powerful ways to leverage the File Kind plugin is by creating Smart Searches and Reports in Diskover. This allows you and your team to quickly access commonly needed file category views without typing queries each time.
Smart Searches
Create saved Smart Searches for frequently accessed file categories:
All Media Files — Combine Video, Audio, and Image categories
Cleanup Candidates — Target Trash and Compressed categories for storage reclamation
Code Review — Find Programming files modified recently
Document Audit — Locate all documents for compliance review
Reports
Build scheduled reports using File Kind classifications to:
Track storage consumption trends by file category over time
Identify growth in specific file types (e.g., video storage increasing)
Monitor cleanup policy effectiveness by tracking Trash category reduction
Support compliance reporting with document distribution analysis
Troubleshooting
Issue | Cause | Solution |
|---|---|---|
Plugin not loading | Plugin not properly installed or enabled | Verify plugin is enabled in both Plugin configuration and Index Task Configuration |
Files showing wrong category | Extension exists in multiple categories | First matching category wins; reorder categories or move extension to preferred category |
All files showing "Other" | Configuration not loaded properly | Restart Diskover Admin after configuration changes; verify configuration syntax |
New category not appearing | Configuration changes not saved | Save configuration and re-run a scan for changes to take effect |
Custom extensions not recognized | Extension not added correctly | Ensure extensions are lowercase and without the leading dot |
Debug Logging
To enable verbose logging for troubleshooting:
Linux:
Check logs at /var/log/diskover/diskover.log
Windows:
Check Diskover service logs or your configured log location
Support
Last Updated: January 2026
Diskover Data, Inc.
Comments
0 comments
Please sign in to leave a comment.