“A wise man proportions his belief to the evidence.” – David Hume, Scottish philosopher
As crypto researchers, we believe the community cannot go mainstream without having enough controls to prevent common investors from getting duped by scammy ICOs and coins. Our motive is to detect scams and warn investors, using machine learning and crowdsourcing techniques.
Comparing cryptos are not easy, given we don’t have enough data points to classify a crypto, or an ICO as scam or not. We follow a similarity based approach where we quantify how good a crypto is with regards to a reference. This way apart from scoring using weights, we can get a similarity score to the reference in a vector space model. A standard way of doing this is by using cosine similarity. Our approach gives two outputs, overall score, and similarity score with regards to reference. Both are between 0-1, where 1 indicates the maximum score. Our assumption is these two values should give enough evidence on how good a crypto is. We collectively chose Ethereum to be the reference point, as it has everything that we are looking forward to in terms of dimensions(utility, team, technology etc.).
We are going to use the below listed features as dimensions to project crytpo into a vector space to compare similarity
T = Team
U = Utility
P = Partnership
W = Technology – Whitepaper
I = Technology – Implementation(Code check-ins etc.)
C = Community
S = Price Data TimeSeries – Pump and Dump, and other manipulation
Max Weights – We’ll re-tune them based on feedback and once we have enough training data
[1,1,1,1,1,1,1] => This would be community generated by giving enough evidences on our subreddit /r/shamcoin/. We will use the evidences to come up with the final vector.
Final Score = Tw1 + Uw2 + Pw3 + Ww4 + Iw5 + Cw6 + Sw7
Final score would be 1, since all the feature scores are 1.
Similarity Score would be 1 as well, because the vectors are exactly same, and cos(0) is 1.
Next Step – Rank every cryptos and ICOs with this model!