According to one of my problem, I find that other also have same situation, here is the quotation:
“…I have a page in Russian language. I have the charset and language in the META set to “Windows-1251″ and respectively “RU”. When running the Google Spider-simulator, it ignores all the Cyrillic characters in the page content, while those in TITLE, DESCRIPTION and KEYWORDS are crawled ok. In the page content only the latin characters are crawled. I’ve checked whether the Spider crawls all the page content to the end and this is actually the case…”
When I switch to charset UTF-8, everything go fine.
In connection with the creation of a new website and because we want maximum results, we do a study on how to properly arrange the words from SEO point of view according to Google’s Matt Cutts SEO expert. Advice he gives can be found here:
Yes, this is understandable enough, but we faced to the following problem – cyrillic words. For example, we have http://www.avalonbg.com/service/software and in Cyrillic language this will sound like:
because we know that google convert cyrillic letters into latin (if you search for “имоти” или “imoti” will come up same result), the result will be www.avalonbg.com/serviz-i-uslugi/instalaciq-na-softuer or www.avalonbg.com/serviz-i-uslugi/instalacia-na-softwareOn top of that, some users write words in several ways: instalacia,instalaciya, instalaciq or just use corect English word installation.
So, because of above tricky situation we bet on a secure method. We just reorder our keywords in English on the basis of our menu level navigation, keeping the number of keywords in a URL to a minimum of 4 to 5. After 35 days we go from PR0 to PR3. Оnother thing that we want to say … When we used the conversion of Latin words method in another futile our website – result was a disaster.