Whitelists for even higher accuracy


Why show fewer matches?

We at PlagScan consider our goal to efficiently prevent plagiarism, so our algorithm is optimized for presenting concise reports that allow a quick judgement of the case.

Accordingly only relevant matches should be included – to account for this, users  can already set the sensitivity of plagiarism detection depending on the use case: Sensitivity regulates when PlagScan will report potential plagiarism. With high sensitivity text matches are reported even if they are short, as this could indicate plagiarism. On the other hand a figure of speech which is accidentally part of both texts could be reported as plagiarism. With low sensitivity you will receive fewer reported matches in turn, but these will most likely not be coincidental.

 

This is the result of our general experience, however there are individual specialties. For example if you have law texts you might not be interested in reporting this common sentence as plagiarism: All human beings are born free and equal in dignity and rights.

Here whitelists come into play.Settings: whitelist types

Types of whitelist filtering

Whitelist-URLs: By adding URLs to the whitelist you can exclude them from the plagiarism check, even if the corresponding web pages contain matching text. In each line you can enter a string. Each URL entailing this string is excluded from the plagiarism check. We recommend to be as specific as possible
Example: en.plagscan.com would mean that pages like
http://en.plagscan.com/faq or
http://www.plagiarismcheckertest.com/en.plagscan.com/ are excluded, while
http://plagscan.com/faq or
http://www.plagiarismcheckertest.com/plagscan/
would still be considered for the plagiarism check.

Whitelist-URLs+: If text from a URL listed here is found during a plagiarism check the match will be treated as a citation. This also holds for matches of the corresponding text from other sources. (For example if Wikipedia is on the list and sentence A is matched with Wikipedia and, say, 5 duplicates of Wikipedia which are not listed still sentence A is counted as citation.)

Whitelist-Phrases: Phrases which shall be ignored for plagiarism search. Enter one per line, without punctuation. Phrases are treated as case-insensitive.

Future plans: Learning

As always we already look at the next steps to improve even more. Whitelists are great but it can be tedious to maintain them manually. For the future we plan to mostly automate this task by learning globally and within the organizations.

We feel these features will bring us closer to the goal of bringing the most useful plagiarism reports to all PlagScan users!

There is a video tutorial about setting up your plagiarism checking. The whitelist feature is also included. Watch the video here.

To learn about specific use cases read The Whitelist – Common Use Cases.

 

 

 


Leave a comment

Your email address will not be published. Required fields are marked *

2 thoughts on “Whitelists for even higher accuracy