One of the problems searching in these books is that there is no standardized spelling in early modern literature.
There is a project called DREaM, at McGill to standardize for "distance reading" (macro analysis).[1] It uses a program called VARD (a text preprocessor trained to correct spelling).[2]
Strangely, this application is licensed with the creative commons. I think this means that it is closed source. Does anyone know of any open source alternatives?
It cannot handle such an immense amount of data,[3]
There are lots of different CC licenses. None of them are closed-source, although there are some which don't allow modification (No-Derivs / ND). Others disallow copying for commercial use (NonCommercial / NC), and a couple allow almost anything (Attribution / BY, No Rights Reserved / CC0). https://creativecommons.org/choose/
> There are lots of different CC licenses. None of them are closed-source, although there are a few which don't allow modification.
Licenses that don't allow modifications are closed-source. At least, they are inconsistent with the Open Source Definition (specifically, with criteria #3: "The license must allow modifications and derived works, and must allow them to be distributed under the same terms as the license of the original software.")
I think that the distribution of VARD is allowed (if non-commerical), but editing the source code isn't. I think this because I cannot find the source for VARD.
That's why I think it's strange it is licensed CC.
I use it for the speed, despite its lack of features & ugliness. A lot of my devices are getting old chug when loading up new Gmail. Speed is why I use the HTML version.
The message is that our personalities are encapsulated by objects and media. These items of evident importance are of course legitimated by the benevolence of facebook's curation. Facebook is not a social experiment, it's a marketing tool.
Anyway, the accuracy of the study depends on a self-report.
>So we've been developing English for a very long time. If "little used verbs become regularized over time, but new verbs formed [are] never formed as irregular verbs", then why do we have still have irregular verbs at all? Why wouldn't they have been wiped out thousands of years ago?
we haven't been developing English for thousands of years. Who knows what will happen then! We'll be speaking in python no doubt.