I propose a software that monitors all files in the "Program Files" and "Documents and Settings" folders of XP based computers. The program would prevent multiple versions of the same document unless the user specifically instructed otherwise.
1) If a person tried to save a different version of the same document, the software would automatically assign a version number to the document and link the two so the user and administrators can track, archive, and/or trash outdated versions of documents.
2) Regardless of the filename, all files would have the data compared and notated in a list. Administrators could then either delete such duplication, coordinate a common name, and/or flag the document for revision control.
3) For common documents that formally require revision control, the administrator would have the option to change a duplicate found on the system to a read-only property and co-locate a link to the currently approved version of the document.
4) The software would provide statistics related to the amount of duplication in the system.
Add your comment
Revision/version control system. Two common systems are CVS and SVG, but they're more for programming code maitnence than anything else. However, I've heard of them being used for document handling.
There are some document files I would like duplicated on separate hard drives as I have found hard drives to fail over time. I would prefer not to delegate to software the possibility to destroy invaluable material.
For drive redundancy, raid-1 is very useful. If you are looking to actually backup files on a periodic basis, rsync works well.
No, that is not what this software is for; but I do appreciate the response.
Between my wife and I we obtain, modify, and produce hundreds of documents a year. In the hectic day to day life we often will take one document that is somewhat close to what we are going to create and then use it as a template to build another document. Often times we will decide to go a different route but that partially revised document still exists on our computers.
We have RAID and we use OneCare to backup our files. However, over the years we have many hundreds of files that just aren't necessary anymore but they are mixed in with files we do want. So our file directories are broad and deep.
What we want is to stop the duplication in our future efforts, and be able to wean out the duplication that has already occurred.
Sand, that's a good point. The software would have to be configurable to allow for synchronization of backups. Either by other software, or resident within the proposed software. Certain directories for instance might be declared as protected or "off limits".
I reccomend getting a system of legitimate MS Office templates made that you can use. CVS is for code revisioning, and rollbacks. You "check out" a document, similar to a library, modify it, and put it back in the system. The system looks at what changed, and makes both versions of the file available. As the new version changes, you can roll back alterations, or merge two versions together. I've never tried using this with something like a word document, so I don't know how well it would work. It works very well for flat text.
There are also dedicated document management systems, but those cost a lot of money.
toastydeath, Regarding comparing two Word Documents.
MicroSoft Word has a manual mechanism to compare 2 documents and allow merging individual differences in either direction. I suppose I could create a macro to automate doing more than one document using that function. But I guess would be more useful is to do a document compare automatically and just tell me by what percentage the 2 documents are the same and display only those pairs that have a certain percentage or higher of duplication (user selectable). That way I can cleanup the worst offenders first.