Sanskrit Newspaper - Sanskrit Dictionary integration using Camel
1) spin up a tomcat container running camel (AWS - EC2)
2) pull from articles in http://sudharma.epapertoday.com/
3) event driven pull from dictionary http://spokensanskrit.de/
4) results merged with editorial content on http://pradyumnsharma.blogspot.in/
The camel ETL component also looks promising, so I worked through the example in the Camel download. Be careful that your later version may no longer set the entity manager in the header of the camel exchange -- I simply grabbed the entity manager factory from the exchange via the registry and it worked fine.
Playing around with this example (which uses JPA to consume records as files and insert/update them into a database) naturally led me to consume instead from a webpage (sudharma) -- merely substituting the file component with the http one. WSnotification might be what is under the hood for the camel http endpoint. I ended up making the following changes to get it to work:
Final comfy solution resulted in creating a new component: webpage.