Both individuals and companies that work with huge volumes of documents recognize that comparing their content is a rather monotonous and meticulous task.
To tackle this very challenge, we created a special automated solution: Intellexer Document Comparator.
Intellexer Document Comparator
Intellexer Document Comparator is basically a tool that accurately compares text or URLs on two levels: lexical and conceptual, and sets a degree of similarity between them. What’s more, it can spot differences in the texts.
The tool extracts named entities and concepts, groups them into different categories: “Location”, “Person”, “Organization”, and many more. Then it compares them and decides which ones occur in both texts and which ones differ.
How it works
A user provides two links or two texts. Intellexer Comparator analyzes the input info and converts it into a vector form. In this new representation, specific words and phrases (initially used as the termbase for the documents) acquire a generalized structure and meaning. Weights are assigned to the meanings of the words (e.g. the weight of the subject is more than that of a simple noun group). As a result, the processing of information takes place at the level of the possible meanings of each word and at the level of the ideas that each sentence and the context, in general, may express.
Intellexer Comparator is a handy tool that collates two texts and determines the degree of proximity between the ideas they express. The proximity is indicated within the range of 0-100%, where “0” means "absolutely different texts" and “100” means "the same text".
Effective comparison is ensured by latent semantic analysis (based on comparison of syntactic and semantic relations), pattern-based text analysis, statistical analysis, and irrelevant word/pattern filtering.
You can get Intellexer Comparator in three different ways suitable for your unique needs: Demo (to get a taste of the system’s capabilities), SDK, and API.
The demo is free and can be found here: https://demo.intellexer.com/document_comparator_demo
If you want to experience all options the tool provides, you can subscribe to Intellexer API and get Comparator as its integrative part — along with Summarizer, Named Entities Recognizer, and others. You can use it for personal, non-commercial purposes, or even embed it in your application.
SDK variant is a perfect fit for developers creating programs and applications. SDK can become an integral part of plagiarism searching programs, news portals, video hosting sites, categorization systems, and many more.
There are many other programs on the market dealing with text comparison (WinMerge, SmartSynchronize, Meld, and others). In contrast, Intellexer Comparator has a number of useful advantages.
Most importantly, our system operates at the level of meaning and ideas, not only syntax, as competitors. This makes the comparison result more accurate and won’t consider sentences written in different words but having the same meaning.
Intellexer Comparator can process URLs and extract concepts among other features.
Calculating texts and concepts similarity percentage makes it easier for a user to perceive the result, they immediately understand where identical and similar texts are and vice versa.
Equally significant for a user is that our demo is free of charge and no registration is needed.
More value awaits
We have a ton of exciting updates planned that you’ll definitely like. We’ll make our products even more intelligent and rich in functionality. Check back soon!
March 12, 2022Back to Blog Main Page
Application based on Intellexer API that performs: