tag:blogger.com,1999:blog-8114624464462183402.post6234243393979187732..comments2023-08-28T17:59:13.699+02:00Comments on CSEARCH NMR-Database: Could someone explain to meWolfgang Robienhttp://www.blogger.com/profile/18294768167257474999noreply@blogger.comBlogger5125tag:blogger.com,1999:blog-8114624464462183402.post-3380124620466102692009-11-02T03:13:09.199+01:002009-11-02T03:13:09.199+01:00I thought the post made some good points on extrac...I thought the post made some good points on extracting data, For simple stuff i use python to get or simplify data,data extraction can be a time consuming process but for larger projects like files, the web, or documents i tried http://www.extractingdata.com which worked great, they build quick custom screen scrapers, extracting data, and data parsing programsatlas245https://www.blogger.com/profile/09312104162096523556noreply@blogger.comtag:blogger.com,1999:blog-8114624464462183402.post-36855510370307865042008-01-08T09:26:00.000+01:002008-01-08T09:26:00.000+01:00Thanks for commenting on the open issues, I think ...Thanks for commenting on the open issues, I think the situation has been clarified. My personal summary: There is good software-support to speed up data-extraction, a few, but decisive tasks are still missing - no wonder, the complete workflow is extremely complex and diverse between between different journals. I agree that some standardization in this field is highly desireable.<BR/><BR/>The picture of money-flow is very much simplified - you can negotiate that, as long as you stay within the legal boundaries. Be sure, the license fees for CSEARCH-algorithms and CSEARCH-data go back 100% into the project and therefore into science.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-8114624464462183402.post-60980929250663397832008-01-08T08:17:00.000+01:002008-01-08T08:17:00.000+01:00Hi Wolfgang,those are impressive increases, really...Hi Wolfgang,<BR/><BR/>those are impressive increases, really. That's a major effort, and very important.<BR/><BR/>It might be worthwhile to go back to one of the reasons why I (and others) think Open Data is important.<BR/><BR/>Let me make clear this is about money. The problem is really not that those who have to enter and curate the data should do this for free. Of course not, that would be stupid.<BR/><BR/>However, the current money flow is the problem. Peter has blogged about this in the early days, and you might find those items interesting as further reading.<BR/><BR/>With proprietary databases the money flow is at first as it should be: money goes from the user to the author who enters the data. However, soon, after the authors have stopped entering new structures, money still flows from the users, but normally no longer to the authors, but to some software company or publishers, but not back into science.<BR/><BR/>And, that's what is worrying me. If that money would be spend of further helping science, it would be much less of a problem, I think. If the money flow, when you stop working on the CSEARCH database would still aid research, that is go to research, instead of managers, or account holders...<BR/><BR/>The second problem is simply being able to verify what things are doing. I don't like black boxes, which proprietary databases are. I find that rather unscientific, as *user*, not as database maintainer.<BR/><BR/>Anyway, *really* impressive numbers!<BR/><BR/>Areas where I think OSCAR3 is helpful:<BR/><BR/>- it can indicated things obviously wrong<BR/>- it can automate drawing of the structure, at least a good draft (stereochemistry excluded)<BR/><BR/>PS. the chemical structure is extracted from the experimental section, by converting the structure name to a structure. Not MDL molfile/etc involved.Egon Willighagenhttps://www.blogger.com/profile/07470952136305035540noreply@blogger.comtag:blogger.com,1999:blog-8114624464462183402.post-8905847302700745952008-01-07T07:17:00.000+01:002008-01-07T07:17:00.000+01:00A few more comments have been posted - according t...A few more comments have been posted - according to my opinion, the situation has been clarified, despite my questions havn't been answered explicitelyAnonymousnoreply@blogger.comtag:blogger.com,1999:blog-8114624464462183402.post-32455874459184778952008-01-06T19:51:00.000+01:002008-01-06T19:51:00.000+01:00One response has been posted by Egon Willighagen -...One response has been posted by Egon Willighagen - see URL aboveAnonymousnoreply@blogger.com