Utilizing a Multimodal Doc ML Mannequin to Question Your Paperwork | by Eivind Kjosbakken | Apr, 2024
Leverage the ability of the mPLUG-Owl doc understanding mannequin to ask questions on your paperwork
This text will talk about the Alibaba document understanding model, lately launched with mannequin weights and datasets. It’s a highly effective mannequin able to performing numerous duties similar to doc query answering, extracting info, and doc embedding, making it a useful device when working with paperwork. This text will implement the mannequin domestically and try it out on completely different duties to offer an opinion on its efficiency and usefulness.
· Motivation
· Tasks
· Running the model locally
· Testing of the model
∘ Data
∘ Testing the first, leftmost receipt:
∘ Testing the second, rightmost receipt:
∘ Testing the first, leftmost lecture note:
∘ Testing the second, rightmost lecture note
· My thoughts on the model
· Conclusion