For those here that don't know what bayesian is. Google's Gmail spam filter is a good example. If you start marking messages from you friends as "Spam" -it's very bad for the system. As good stuff in the future will get blocked.
Hence you need to start with "seed" data that is accurate and continue to build on a foundation of a priori knowledge.