Alexa offers a limited coverage of websites, so we need to complement it with other sources that will help us improve and extend the coverage.
For that, we should use a combination of other 4 sources as explained in this paper: http://arxiv.org/pdf/1411.5281v1.pdf
These are: Cyren, Google Ad Words, McAfee and WebPulse.
Pointers for those sources are mentioned in the paper. If we have any doubts we may write or call paper authors. I know them