GSoC 2018: SpamBrainz - Fighting spam with machine learning

Notebooks, references and revised timeline

Notebooks:

References

Papers/scientific articles

Blog posts

Repositories

Books

  • Ng, Andrew (unpublished): Machine Learning Yearning

GDPR-Related

Revised timeline

  • Weeks 0-3: Researched existing projects, state-of-the-art of spam detection and machine learning.
  • Week 1: Designed SpamBrainz’ project structure and started work on monitoring and management web interface.
  • Week 3: Researched potential GDPR issues regarding SpamBrainz.
  • Weeks 4-5: Received Spam data set, analyzed, compared and visualized spam/non-spam editor data.
  • Week 6-7: Build the API and the SpamBrainz backend which the detection modules can plug into and a predictable dummy module to test it.
  • Weeks 8-11: Create the machine learning module and tweak the hyperparameters to hopefully get it to a usable state.
  • Week 12: Documentation, designing a simple website explaining SpamBrainz to our end users.
  • Week 13: Buffer period.
6 Likes