国产主播一区二区,欧美日韩亚洲一区二区三区,99视频一区

Algorithm Study And So On

poj 1200 Crazy Search 字符串hash

這個題是求一個字符串里面出現了多少個長度為N的不同子串，同時給出了母串里面不同字符
的個數NC。
保存子串到set里面直接暴力肯定超時了。這個題有個利用字符串hash的解法，雖然理論上有
bug，但是能過這個題。
利用給出的NC，對長度為N的字符串，將其當作NC進制的數字，求出其值，對值進行hash，
求出不同的hash位置個數。
這個算法其實類似于Karp-Rabin字符串匹配算法。不過，Karp-Rabin算法做了點改進，對
進制為D的字符串求值的時候為了防止溢出會模一個素數，而且不會每次都迭代求下一個子串的
值，而是從當前子串的值直接遞推出下一個字符的值。怎么遞推了，其實很簡單，就是當前值去
掉最高位再乘以D(相當于左移一位,不過是D進制的，不能直接用<<符號)，再加上新的最低位。
Karp-Rabin算法應該主要在于設計出合理的hash算法，比如，用取模hash函數的話，得保
證hash表足夠大，否則沖突太多，速度就不會怎么好了。比如這個題，hash表小了就AC不了了。

代碼如下：

#include <stdio.h>
#include <string.h>

const int MAX = 13747347;
int nHash[MAX];
char szStr[17000001];
int nN, nNC;
int nW[200];

void Insert(int nKey)
{
    int nPos = nKey;
    while (nHash[nPos] != -1 && nHash[nPos] != nKey)
    {
        nPos = (nPos + 1) % MAX;
    }
    nHash[nPos] = nKey;
}

bool Find(int nKey)
{
    int nPos = nKey;
    while (nHash[nPos] != -1 && nHash[nPos] != nKey)
    {
        nPos = (nPos + 1) % MAX;
    }
    return nHash[nPos] != -1;
}

int main()
{
    while (scanf("%d%d%s", &nN, &nNC, szStr) == 3)
    {
        memset(nW, 0, sizeof(nW));
        memset(nHash, -1, sizeof(nHash));
        int nNum = 0;
        int nSize = 0;
        for (char* pszStr = szStr; *pszStr; ++pszStr)
        {
            if (!nW[*pszStr])
            {
                nW[*pszStr] = ++nNum;
            }
            ++nSize;
        }

        int nKey = 0;
        int nAns = 0;
        int nPowN = 1;
        for (int j = 0; j < nN; ++j)
        {
            nKey = (nKey * nNC + nW[szStr[j]]) % MAX;
            nPowN *= nNC;
        }
        nPowN /= nNC;
        if (!Find(nKey))
        {
            Insert(nKey);
            nAns++;
        }

        for (int i = nN; i < nSize; ++i)
        {
            nKey = (nNC * (nKey - nPowN * nW[szStr[i - nN]])
                    + nW[szStr[i]]) % MAX;
            nKey = (nKey + MAX) % MAX;

            if (!Find(nKey))
            {
                Insert(nKey);
                nAns++;
            }
        }

        printf("%d\n", nAns);
    }

    return 0;
}

posted on 2012-09-27 22:07 yx 閱讀(1064) 評論(0) 編輯收藏引用所屬分類: 字符串

只有注冊用戶登錄后才能發表評論。
【推薦】100%開源！大型工業跨平臺軟件C++源碼提供，建模，組態！

相關文章: hdu 3068 最長回文 Manacher算法 poj 3294 Life Forms 后綴數組求至少出現在K個字符串中的最長公共子串 poj 1226 Substrings 后綴數組 poj 3691 DNA repair AC自動機 + dp poj 1625 Censored! AC自動機 + DP + 大數加法 poj 1509 Glass Beads 字符串最小表示 hnu 2243 考研路茫茫——單詞情結 AC自動機+矩陣冥累加和 poj 2778 DNA Sequence AC自動機+矩陣快速冥 hnu 10076 Jimmy's Riddles DFA poj 2406 Power Strings kmp的妙用

網站導航: 博客園 IT新聞 BlogJava 博問 Chat2DB 管理