Getvisibility discovers unstructured data and documents across an organisation’s estate. Unstructured data usually refers to information that does not reside in a traditional row-column database and is usual in the form of documents and emails.


It is estimated that up to 90% of the data in any organization is unstructured in this form. Additionally, the amount of unstructured data can rapidly increase over time.


With Getvisibility, the discovery process is simple with the platform being pointed to the various locations and granted access permissions. After a short installation and configuration process, organisations can point the platform to various data sources either individually, or as part of a phased roll-out, or all at once for company-wide discovery.


Getvisibility can discover unstructured data from various sources such as file shares, cloud platforms (AWS, Azure), and discover over 70 file types including but not limited to PDFs, all Microsoft documents, mark-up formats, and so forth.


With state-of-the-art machine learning algorithms, Getvisibility combines natural language processing with neural networks and optical character recognition. This allows us to classify unstructured data across organizations with unparalleled accuracy and speed without affecting the file servers during the scan.


Using machine learning rather than traditional pattern matching (regular expressions) and dictionary lookup methods allows Getvisibility to understand the context of a document, thereby increasing accuracy.


As the neural network does the majority of the work, organisations no longer have to embark on the laborious and expensive task of creating rules and regex hits per department and document type.


Getvisibility’s customizable tag set enables users to apply company-specific classification to their unstructured data, which the neural network learns with increasing accuracy. Training of the neural network can be done through our user-friendly interface, eliminating the need for the highly qualified engineers and data scientists associated with traditional methods.


The graph below is taken from our production deployments and illustrates an accuracy improvement of approximately 7% per hour of retraining – an optional but a powerful method supported by our software. Companies can also opt to only upgrade the system monthly and not spend time on any retraining and still see improvements thanks to the machine learning.



You cannot protect what you do not understand. Through accurate classification of unstructured data, Getvisibility provides organisations with an in-depth view of their unstructured data landscape.


Risk scores are also calculated per document assessing the confidentiality, permissions, context and content of a document. This enables organisations to make informed decisions surrounding their data security posture and focus on data at risk.


Getvisibility’s in-depth reports allow companies to monitor the impact of their security initiatives and assess their competency and success.


Getvisibility also integrates with multiple Data Loss Prevention (DLP) platforms such as Microsoft AIP and Forcepoint. These integrations allow organisations to automatically apply DLP tags to unstructured data, depending on their classification and content. This greatly accelerates the data loss prevention project, while reducing cost and increasing the efficacy.