How to organize a large number of objects
- by shane
We have a large number of documents and metadata (xml files) associated with these documents. What is the best way to organize them?
Currently we put them into a series of nested folders:
/repository/category/date(when they were loaded into our db)/document_number.pdf and .xml
We use the path as a unique identifier for the document in our system.
This is more versatile than putting them all in a single flat folder. also it is independent from our database/application, so we can reload them in case of failure.
Yet, it introduces some limitations. for example we can't move the files once they've been placed in this structure, also it takes work to put them this way.
What is the best practice? How websites such as Scribd deal with this problem?