A data-driven approach to identifying development stages in diachronic corpus linguistics https://corpling.hypotheses.org/2551