Most of the aforementioned 102 duplicate cleaners are not "movie savvy". The bytes may differ, even the picture quality itself slightly, but video-aware software will detect the similarity, almost as a human movie watcher would. Media files need a computer program that can understand the video, not the raw bytes stored in them. What we need therefore is software that can find movie content duplicates, allowing for differences in HD resolution and file formats ( WMV,AVI,MOV,ASX,MKV,MP4,ASF,MTS,TTS.). It is the same movie, but the file sizes and durations vary slightly, one is widescreen HD and the other 720p - still they represent the same movie! Take for instance the following 2 video files. This is next to impossible for movies, where there is no universally accepted tagging system (like ID3 tags for MP3 music). Some simpler duplicate files software ignore the audio/video content and try to find similar movies using file attributes like the name and duration. Such visual or audio algorithms can be used to compare movies, and see through differences in video resolutions, frame rates and dimensions, soundtrack loudness etc, to identify visually identical files - as opposed to binary-identical. Some programs compare a succession of thumbnails (frames) using similar image detection methods, others focus on the soundtrack and use audio fingerprints to align and compare the audio streams. Recognizing movies calls for specialized software that can decompress and extract the video and audio streams from the movie file, then use a number of algorithms to compare them and discover similar movies. So a movie that you or me would instantly recognize, is hard to tell looking at the raw bytes stored in the file. Each container allows multiple resolutions (720p, FullHD, UHD etc) for increased movie quality (and byte size!). Video is encoded compressed in various formats (along with its audio streams), in containers such as Matroska MKV, the old Audio Video Interleave (AVI) file, MPEG-4 (think MP3 but for video) and many more. This is fast and safe, but will not work very well for your movie file collection. You just examine all the files in your hard disk byte-for-byte, and discover all 100% identical files. The technology behind duplicate file software is quite simple, almost trivial. How do you discover near-duplicate video? We believe that the data are representative, almost scientific :) You are welcome to try your own movie library for duplicates - you should find the same results more or less, relatively speaking. The most popular shareware programs that claim to deal with near-duplicate movie files are included, and their performance is measured on a fixed set of video files (movie collection library). Clearly the author is biased, but this review is objective.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |