Research data for Spam

Welcome

Research data for spam (rdata4spam) emerged as corpora repository designed keeping in mind the protection of the authors of compiled texts, ensuring all techniques can be applied and facilitating preprocessing of data according the user needs.

is powered by STRep software (Spam Text Repository, https://github.com/sing-group/strep), BDP4J (Big Data Preprocessing for Java, https://github.com/sing-group/bdp4j) and NLPA (Natural Language Pre-Processing Architecture, https://github.com/sing-group/nlpa).

Welcome

Research data for spam (rdata4spam) emerged as corpora repository designed keeping in mind the protection of the authors of compiled texts, ensuring all techniques can be applied and facilitating preprocessing of data according the user needs.

rdata4spam is powered by STRep software (Spam Text Repository, https://github.com/sing-group/strep), BDP4J (Big Data Preprocessing for Java, https://github.com/sing-group/bdp4j) and NLPA (Natural Language Pre-Processing Architecture, https://github.com/sing-group/nlpa).