This project is funded by the European Union under the Seventh Framework Programme Grant Agreement n°313082
European Union


The two platforms – Cortext and SMS digital platforms – are complementary in their focus. SMS aims at taking advantage of the new resources provided by the web. The semantic challenge is to combine these data and tools to support construction and use of datasets, to be exploited by researchers. Cortext is an integrated tool dedicated to the cleaning and treatment of corpuses produced by researchers, whether structured or unstructured..

SMS is a platform – a set of related tools – that focuses on accessing, combining and analyzing heterogeneous data about scholarly communication and behavior. The main focus is on Web data. Additionally, the platform also supports the deployment of traditional databases and the combination with web data. SMS is thus focused on open data, but it also provides access to several of databases about scholarly communication – partly web based, partly not – which are proprietary and therefore not easily accessible for researchers.

For accessing SMS platform, click here.

The CorText platform is a unique computational facility that has been developed to treat large textual corpuses in the field of science and technology, with the objective of helping researchers to analyse knowledge dynamics in multiple situations (emergence of new breakthrough knowledge, new forms of knowledge structuration, knowledge dealing with a societal issues, disputes and public debates around new knowledge).

Four features make the Cortext platform unique. First it combines multiple scientific and technical competencies: natural language processing, information extraction, knowledge management, complex networks analysis, scientometrics, web design and computer science. Second, it plays a major role in integrating and articulating existing tools (focusing mostly on open source ones). Third it enables the semantic analysis of large textual corpuses (most developments being limited in size). Fourth it has made a major effort in designing interfaces and workflows.

It is a ‘service’ and works as follows: users, wherever they are, upload their raw database, use the tools of the platforms to ‘clean’ and ‘enrich’ it; they then perform multiple treatments to analyse the dataset and then mobilise visualisation tools that help interpreting the results.

For accessing CORTEXT Manager, click here.