Being a freelance developer has taught me several lessons for delivering salable and adaptive .NET software solutions for my clients.
Lately I have been working on a generic client server application which basically deals with extracting some key metadata from "PDF" images.
I say pdf image because that is what the product are - some handwritten documents that are scanned but cannot be converted to readable text using OCR.
- These document can be found in many place - Historical Archives,Libraries, Schools and Offices.
- This project was aimed to convert about 150 handwritten books, very large format. A2, A3 sizes.
- Books has as much as 1000 pages on average.
- on Average there was about 300 to 400 individual chapters per book.
- These were historical document, and each chapter was it, own story, it own book. Thus a lot of pre processing to extract the document into their individual chapters.
My Job was to build a quick application to manage this archiving process - a software that will display the document as pdf to the end user on a large 20 Inch LCD monitor, and also display a sidebar for the metadata.
See the concept in figure 1.0
Developing a file metadata system -Part 1.
Please continue following this blog. Developing a file metadata system -Part 2
Comments
Post a Comment