首页猿问重写System.Object.G...

重写System.Object.GetHashCode的最佳算法是什么？

.NET C++ C#

米琪卡哇伊 2019-05-22 14:40:36

重写System.Object.GetHashCode的最佳算法是什么？在.NET System.Object.GetHashCode方法中，很多地方都使用.NET 方法。特别是在快速查找集合中的项目或确定相等性时。是否有关于如何GetHashCode为我的自定义类实现覆盖的标准算法/最佳实践，因此我不会降低性能？

查看完整描述

4 回答

白板的微信

TA贡献1883条经验获得超3个赞

我通常会使用Josh Bloch 精彩的 Effective Java中的实现。它很快并且创建了一个非常好的哈希，不太可能导致冲突。选择两个不同的素数，例如17和23，并执行：

public override int GetHashCode(){
    unchecked // Overflow is fine, just wrap
    {
        int hash = 17;
        // Suitable nullity checks etc, of course :)
        hash = hash * 23 + field1.GetHashCode();
        hash = hash * 23 + field2.GetHashCode();
        hash = hash * 23 + field3.GetHashCode();
        return hash;
    }}

正如评论中所指出的，你可能会发现最好选择一个大素数乘以。显然486187739是好的...虽然我看到的大多数例子都是小数字倾向于使用素数，但至少有类似的算法，其中经常使用非素数。例如，在后来的非FNV例子中，我使用的数字显然效果很好 - 但初始值不是素数。（乘法常数虽然是素数。我不知道它有多重要。）

这比XOR出于哈希码的常见做法更好，主要有两个原因。假设我们有一个包含两个int字段的类型：

XorHash(x, x) == XorHash(y, y) == 0 for all x, yXorHash(x, y) == XorHash(y, x) for all x, y

顺便说一下，早期的算法是C＃编译器当前用于匿名类型的算法。

这个页面提供了很多选择。我认为在大多数情况下，上述情况“足够好”并且非常容易记住并且正确。所述FNV替代方案是同样简单，但使用不同的常数和XOR代替ADD作为组合操作。它看起来的东西像下面的代码，但正常的FNV算法对每个字节进行操作，所以这将需要修改来执行的，而不是每32位的哈希值每字节一个迭代。FNV也是为可变长度的数据而设计的，而我们在这里使用它的方式总是针对相同数量的字段值。对这个答案的评论表明，这里的代码实际上并不像上面的添加方法那样（在测试的示例中）。

// Note: Not quite FNV!public override int GetHashCode(){
    unchecked // Overflow is fine, just wrap
    {
        int hash = (int) 2166136261;
        // Suitable nullity checks etc, of course :)
        hash = (hash * 16777619) ^ field1.GetHashCode();
        hash = (hash * 16777619) ^ field2.GetHashCode();
        hash = (hash * 16777619) ^ field3.GetHashCode();
        return hash;
    }}

请注意，有一点需要注意的是，理想情况下，在将其添加到依赖于哈希代码的集合之后，应该防止对等式敏感（因此对哈希码敏感）状态的更改。

根据文件：

您可以为不可变引用类型重写GetHashCode。通常，对于可变引用类型，只有在以下情况下才应覆盖GetHashCode：
您可以从不可变的字段计算哈希码; 要么
您可以确保在对象包含在依赖于其哈希代码的集合中时，可变对象的哈希码不会更改。

反对回复 2019-05-22

饮歌长啸

TA贡献1951条经验获得超3个赞

这是我的哈希码助手。
它的优点是它使用泛型类型参数，因此不会导致装箱：

public static class HashHelper{
    public static int GetHashCode<T1, T2>(T1 arg1, T2 arg2)
    {
         unchecked
         {
             return 31 * arg1.GetHashCode() + arg2.GetHashCode();
         }
    }

    public static int GetHashCode<T1, T2, T3>(T1 arg1, T2 arg2, T3 arg3)
    {
        unchecked
        {
            int hash = arg1.GetHashCode();
            hash = 31 * hash + arg2.GetHashCode();
            return 31 * hash + arg3.GetHashCode();
        }
    }

    public static int GetHashCode<T1, T2, T3, T4>(T1 arg1, T2 arg2, T3 arg3, 
        T4 arg4)
    {
        unchecked
        {
            int hash = arg1.GetHashCode();
            hash = 31 * hash + arg2.GetHashCode();
            hash = 31 * hash + arg3.GetHashCode();
            return 31 * hash + arg4.GetHashCode();
        }
    }

    public static int GetHashCode<T>(T[] list)
    {
        unchecked
        {
            int hash = 0;
            foreach (var item in list)
            {
                hash = 31 * hash + item.GetHashCode();
            }
            return hash;
        }
    }

    public static int GetHashCode<T>(IEnumerable<T> list)
    {
        unchecked
        {
            int hash = 0;
            foreach (var item in list)
            {
                hash = 31 * hash + item.GetHashCode();
            }
            return hash;
        }
    }

    /// <summary>
    /// Gets a hashcode for a collection for that the order of items 
    /// does not matter.
    /// So {1, 2, 3} and {3, 2, 1} will get same hash code.
    /// </summary>
    public static int GetHashCodeForOrderNoMatterCollection<T>(
        IEnumerable<T> list)
    {
        unchecked
        {
            int hash = 0;
            int count = 0;
            foreach (var item in list)
            {
                hash += item.GetHashCode();
                count++;
            }
            return 31 * hash + count.GetHashCode();
        }
    }

    /// <summary>
    /// Alternative way to get a hashcode is to use a fluent 
    /// interface like this:<br />
    /// return 0.CombineHashCode(field1).CombineHashCode(field2).
    ///     CombineHashCode(field3);
    /// </summary>
    public static int CombineHashCode<T>(this int hashCode, T arg)
    {
        unchecked
        {
            return 31 * hashCode + arg.GetHashCode();   
        }
    }

它还有扩展方法来提供流畅的界面，所以你可以像这样使用它：

public override int GetHashCode(){
    return HashHelper.GetHashCode(Manufacturer, PartN, Quantity);}

或者像这样：

public override int GetHashCode(){
    return 0.CombineHashCode(Manufacturer)
        .CombineHashCode(PartN)
        .CombineHashCode(Quantity);}

反对回复 2019-05-22

4 回答
0 关注
844 浏览

关注

添加回答

0/150

提交

取消

热搜

最近搜索清空

重写System.Object.GetHashCode的最佳算法是什么？

重写System.Object.GetHashCode的最佳算法是什么？

4 回答

添加回答