Notebooks, references and revised timeline
Notebooks:
References
Papers/scientific articles
- Bhowmick, Hazarika (2016) Machine Learning for E-mail Spam Filtering: Review,
Techniques and Trends
- Caruana, Li (2012) A survey of emerging approaches to spam filtering
- Arif, Li et al. (2017) Sentiment analysis and spam detection in short informal text using learning classifier systems
- Giyanani, Desai (2014) Spam Detection using Natural Language Processing
- Androutsopoulos, Paliouras et al. (2000) Learning to Filter Spam E-Mail: A Comparison of a Naive
Bayesian and a Memory-Based Approach
- Santosh, Maity, Mukherjee (2017) ENWalk: Learning Network Features for Spam
Detection in Twitter
- Miikkulainen, Liang et al. (2017) Evolving Deep Neural Networks
- Ren, Triantafillou et al. (2018) Meta-Learning for Semi-Supervised Few-Shot Classification
- Chen, Shah (2018) Explaining the Success of Nearest Neighbor Methods in Prediction
Blog posts
Repositories
Books
- Ng, Andrew (unpublished): Machine Learning Yearning
GDPR-Related
Revised timeline
- Weeks 0-3: Researched existing projects, state-of-the-art of spam detection and machine learning.
- Week 1: Designed SpamBrainz’ project structure and started work on monitoring and management web interface.
- Week 3: Researched potential GDPR issues regarding SpamBrainz.
- Weeks 4-5: Received Spam data set, analyzed, compared and visualized spam/non-spam editor data.
- Week 6-7: Build the API and the SpamBrainz backend which the detection modules can plug into and a predictable dummy module to test it.
- Weeks 8-11: Create the machine learning module and tweak the hyperparameters to hopefully get it to a usable state.
- Week 12: Documentation, designing a simple website explaining SpamBrainz to our end users.
- Week 13: Buffer period.
6 Likes