C++【哈希表封装unordered_map/set】

文章目录

- （1）修改原哈希表
- （2）迭代器
- （3）最后一步
- （4）关于key是自定义类型的额外补充(面试题)
- （5）源代码

（1）修改原哈希表

和红黑树封装一样的逻辑，unordered_map/set公用一张哈希表，所以改的时候两个容器都要一块取考虑，因为需要通过泛型去设计，要有复用思想。

这里一部分就简单说一下，比如第二个参数是K就是K，为什么要传第一个模板参数，它要和map复用，因为map也需要用到key，第一个模板参数拿到K的类型，因为查找，删除等接口的参数是K，第二个模板参数决定了树的节点存什么，是K还是KV。
在这里插入图片描述

我们一步一步加并修改原来哈希表的内容，这里就不叫V，自己写的时候这叫T，把节点结构体里面的pair类型改成T，构造函数里面用data初始化_data。以及后面涉及到pair<K,V>&kv的全都改成T&data。unordered_map就通过第二个模板参数控制node里面的data，同理unordered_set也一样。

之前是kv现在要改为_data，把kv.first改为_data,但是这里的第二个_data并不是我们想要的，因为这里的data到底是什么并不知道，上一层加了一层泛型。这需要用仿函数解决Key。原来哈希表里面涉及到hash(cur->kv.first) 需要成hash(kot(cur->_data))，表示通过指针获取节点的_data数据，我们并不知道这是什么类型需要再用仿函数Key创建的对象kot调用()来获取具体的数据，最后通过hash()来获取键值所对应的存储位置。

（2）迭代器

在这里插入图片描述
如上图，对于++it怎么走，大致思路是这样的，如果node不等于空继续往下走，如果为空，就需要找下一个桶，
这里写迭代器的时候因为++需要用到哈希表，就需要前置声明，这里存在互相引用的问题，迭代器需要哈希表，哈希表需要迭代器，就要把哈希表定义在前面，其他的一样的老套路，只不过里面多了一个哈希表指针，构造的时候需要初始化，另外另一个构造函数一会实现，主要这里先把迭代器++实现出来，因为它的迭代器只支持单向，没有减减。
实现迭代器加加解析：
如果node的下一个指针不为空，就继续往下走更新_node,如果为空，可能走到尾部了，就需要找下一个桶，下一个不为空的桶在哪我们不知道，先算自己在哪，先把_data里面的key取出来，再用哈希的仿函数获得整数，模上表的大小找到映射位置，算出来hashi加加一下，就去找不为空的桶，如果hashi小于数据个数，就进如循环，再判断如果下一个位置不为空，就把表的第一个节点指针给_node，否则++hashi，就过走完了整个表，没有找到下一个位置，就用空取替带_node。最后返回this指针。如下图代码：

Self& operator++(){if (_node->_next != nullptr){_node = _node->_next;}else{Key kot;Hash hash;size_t hashi = hash(kot(_node->_data)) % _ht->_tables.size();++hashi;while (hashi < _ht->_tables.size()){if (_ht->_tables[hashi]){_node = _ht->_tables[hashi];break;}else{++hashi;}if (hashi == _ht->_tables.size()){_node = nullptr;}}}return *this;}

我们实现begin和end，begin：要返回第一个桶的第一个数据，这个begin不是直接构建的，需要遍历去找第一个不为空的桶，找到之后就构建迭代器，但是除了传找到的这个节点，还得传哈希表，我们直接可以用this指针，就是指向哈希表对象本身。
end：就直接拿空就构造就可以
同理const_begin和const_end同理

            iterator begin(){Node* cur = nullptr;for (size_t i = 0; i < _tables.size(); ++i){cur = _tables[i];if (cur){break;}}return iterator(cur, this);}

上面需要一些细节，迭代器需要访问_table的私有，就让我们这个哈希表这个类提供一个get.size和get.table,也可以写一个友元函数，因为是模板还得需要加上模板参数，如图：
在这里插入图片描述

我们继续在unordered_map增加迭代器参数，同理unordered_set也一样，在哈希表类里里面曾加一个普通迭代器和const迭代器，如图下图：
注意这里的unordered_set两个普通迭代器底层用的都是const迭代器，unordered_map普通迭代器用的普通迭代器，因为它需要修改它的Value，它的K不能修改是通过paie里面的constkey达到的，而unordered_set只有K，K是不是能修改的。

在这里插入图片描述
但是直接用会报错，因为unordered_set调用begin的时候，普通对象调用的普通begin，返回的是普通迭代器，但这里是const迭代器，这里就不具体演示了，和红黑树封装一样的逻辑。具体原因是我们迭代器里没有写对应的构造，需要一个支持普通到const迭代器的构造，我们在里面单独加一个普通迭代器Iterator，构造里它的参数前面加一个const，具体就不说了可以看红黑树封装。其实就是权限的缩小。这里还有一个细节，如果调用const_iterator会有问题，因为begin是const，返回时，迭代器里的this指针就是const的，这时候要要把迭代器构造的哈希表对象指针改成const的。因为迭代器里面不会修改这里的哈希表，需要取遍历它，传普通传const都可以接收，因为权限可以缩小。
在这里插入图片描述

（3）最后一步

在这里插入图片描述

我们在unordered_map里面提供一个[]，它要去访问第二个V值也就是pair的second，如果它没有就插入，如果有就返回key所在节点的迭代器，因为里面调用的是insert，这样就不仅能查找对应的value值，并有修改功能并可以插入新的键值对,insert返回值得个是个pair类型。而且并把有关的insert返回值，如果返回成功就返回新插入节点的构造的迭代器加一个true即make_pair(iterator(newnode, this), true)，同样unordered_set的insert也一样改为pair<iterator,bool>,里面是是一个迭代器和布尔值。
总之，insert返回值是个pair类型，find返回的是一个迭代器，erase返回bool值，unordered_set除了不提供[]，其他都一样。
另外，我们的key还可以支持自定义类型，但是需要要满足支持转换成取模的仿函数，和等于比较和等于比较，调用unordered_map/set需要显示传HashDate仿函数如图：
在这里插入图片描述
如下图我们进行的测试和结果：

（4）关于key是自定义类型的额外补充(面试题)

1、一个类型要做unordered哈希系列的key：要满足支持转换成取模的仿函数和等于比较，如上。
2、一个类型要做map系列的key：要满足支持<比较。
在STL源码库中，map默认的第三个参数为less<_Key>，就是将key按照从小到大的顺序排列，其内部实现很简单，就是直接比较两个key的类型返回。但是如果是自定义类型的时候，直接用默认的less比较的话会报错，因为编译器不知道怎么处理，即不知道如何比较他们的大小，它能应对内置类型比如int等，比如复合类型string。如何解决？2种方法：
第一种方法：继续使用第三个模板参数，因为第三个模板参数内部是直接比较key的类型大小，我们可以在自定义的类型里实现重载<运算符，让它知道如何比较这两个key，因为默认参数是less，所以只需要重载<运算符就可以了，如果参数是greater，那么就需要重载>运算符。

struct nza {nza(int n1 = 0, int n2 = 1):_n1(n1), _n2(n2) {}bool operator<(const nza& Data) const{return _n1+_n2  < Data._n1 + Data._n2;}int _n1;int _n2;
};int main() {map<nza, int> myMap;}

第二种方法就是：自己实现一个仿函数。

struct nza {nza(int n1 = 0, int n2 = 1):_n1(n1), _n2(n2) {}int _n1;int _n2;
};
template<class T>
struct RLcmp {bool operator()(const T& Data1, const T& Data2)const {return Data1._n1 + Data1._n2< Data1._n1 + Data2._n2;}
};int main() 
{map<nza, int, RLcmp<nza> > myMap;
}

（5）源代码

OpenHash.h

#pragma once
#include<utility>
#include<vector>
#include<string>
#include<iostream>
using namespace std;template<class K>
struct HashFunc
{size_t operator()(const K& key){return key;}};
template<>
struct HashFunc<string>
{size_t operator()(const string& str){size_t hash = 0;for (auto& e : str){hash += e;hash *= 131;}return hash;}};template<class T>struct HashNode{T _data;HashNode<T>* _next;HashNode(const T& data):_next(nullptr), _data(data){}};template<class K, class T, class Key, class Hash>class HashTable;template<class K, class T, class Ref, class Ptr, class Key, class Hash>struct _HashIterator{typedef  HashNode<T> Node;typedef HashTable<K, T, Key, Hash> HT;typedef _HashIterator<K, T, Ref, Ptr, Key, Hash> Self;typedef _HashIterator<K, T, T&, T*, Key, Hash> Iterator;Node* _node;const HT* _ht;_HashIterator(Node* node, const HT* ht):_node(node), _ht(ht){}_HashIterator(const Iterator& it):_node(it._node), _ht(it._ht){}Ptr operator->(){return &_node->_data;}Ref operator*(){return _node->_data;}bool operator!=(const Self& s){return _node != s._node;}bool operator==(const Self& s) const{return _node == s._node;}Self& operator++(){if (_node->_next != nullptr){_node = _node->_next;}else{Key kot;Hash hash;size_t hashi = hash(kot(_node->_data)) % _ht->_tables.size();++hashi;while (hashi < _ht->_tables.size()){if (_ht->_tables[hashi]){_node = _ht->_tables[hashi];break;}else{++hashi;}if (hashi == _ht->_tables.size()){_node = nullptr;}}}return *this;}};template<class K, class T, class Key, class Hash>class HashTable{template<class K, class T, class Ref, class Ptr, class Key, class Hash>friend struct _HashIterator;typedef HashNode<T> Node;public:typedef _HashIterator<K, T, T&, T*, Key, Hash> iterator;typedef _HashIterator<K, T, const T&, const T*, Key, Hash> const_iterator;iterator begin(){Node* cur = nullptr;for (size_t i = 0; i < _tables.size(); ++i){cur = _tables[i];if (cur){break;}}return iterator(cur, this);}iterator end(){return iterator(nullptr, this);}const_iterator begin()const{Node* cur = nullptr;for (size_t i = 0; i < _tables.size(); ++i){cur = _tables[i];if (cur){break;}}return const_iterator(cur, this);}const_iterator end()const{return const_iterator(nullptr, this);}~HashTable(){for (auto& cur : _tables){while (cur){Node* next = cur->_next;delete cur;cur = next;}cur = nullptr;}}pair<iterator, bool> Insert(const T& data){Key kot;iterator it = Find(kot(data));if (it != end()){return make_pair(it, false);}Hash hash;if (_n == _tables.size()){/*size_t newsize = _tables.size() == 0 ? 10 : _tables.size() * 2;*/size_t newsize = GetNextPrime(_tables.size());vector<Node*> newtables(newsize, nullptr);for (auto& cur : _tables){while (cur){Node* next = cur->_next;;size_t hashi = hash(kot(cur->_data)) % newtables.size();cur->_next = newtables[hashi];newtables[hashi] = cur;cur = next;}}_tables.swap(newtables);}size_t hashi = hash(kot(data)) % _tables.size();Node* newnode = new Node(data);newnode->_next = _tables[hashi];_tables[hashi] = newnode;++_n;return make_pair(iterator(newnode, this), true);}iterator Find(const K& key){Key kot;if (_tables.size() == 0)return end();Hash hash;size_t hashi = hash(key) % _tables.size();Node* cur = _tables[hashi];while (cur){if (kot(cur->_data) == key){return iterator(cur, this);}cur = cur->_next;}return end();}bool Erase(const K& key){Key kot;Hash hash;size_t hashi = hash(key) % _tables.size();Node* prev = nullptr;Node* cur = _tables[hashi];while (cur){if (kot(cur->_data) == key){if (prev == nullptr){_tables[hashi] = cur->_next;}else{prev->_next = cur->_next;}delete cur;return true;}else{prev = cur;cur = cur->_next;}}return false;}size_t GetNextPrime(size_t prime){static const int __stl_num_primes = 28;static const unsigned long __stl_prime_list[__stl_num_primes] ={53, 97, 193, 389, 769,1543, 3079, 6151, 12289, 24593,49157, 98317, 196613, 393241, 786433,1572869, 3145739, 6291469, 12582917, 25165843,50331653, 100663319, 201326611, 402653189, 805306457,1610612741, 3221225473, 4294967291};size_t i = 0;for (; i < __stl_num_primes; ++i){if (__stl_prime_list[i] > prime)return __stl_prime_list[i];}return __stl_prime_list[i];}private:vector<Node*> _tables;size_t _n = 0;};

unordered_Map.h

#pragma once
#include"OpenHash.h"
namespace nza
{template<class K, class V,class Hash=HashFunc<K>>class unordered_Map{public:struct MapKey{const K& operator()(const pair<K,V>& kv){return kv.first;}};public:typedef  typename HashTable<K, pair<const K,V>, MapKey,Hash>::iterator iterator;typedef  typename HashTable<K, pair<const K,V>, MapKey,Hash>::const_iterator const_iterator;iterator begin(){return _ht.begin();}iterator end(){return _ht.end();}const_iterator begin() const{return _ht.begin();}const_iterator end() const{return _ht.end();}V& operator[](const K& key){pair<iterator, bool> ret = insert(make_pair(key, V()));return ret.first->second;}pair<iterator, bool> insert(const pair<K, V>& kv){return _ht.Insert(kv);}iterator find(const K& key){return _ht.Find(key);}bool erase(const K& key){return _ht.Erase(key);}private:HashTable<K,pair<const K, V>, MapKey,Hash> _ht;};class Date{friend struct HashDate;public:Date(int year = 2000, int month = 12, int day = 23): _year(year), _month(month), _day(day){}bool operator<(const Date& d)const{return (_year < d._year) ||(_year == d._year && _month < d._month) ||(_year == d._year && _month == d._month && _day < d._day);}bool operator>(const Date& d)const{return (_year > d._year) ||(_year == d._year && _month > d._month) ||(_year == d._year && _month == d._month && _day > d._day);}bool operator==(const Date& d) const{return _year == d._year&& _month == d._month&& _day == d._day;}friend ostream& operator<<(ostream& _cout, const Date& d);private:int _year;int _month;int _day;};ostream& operator<<(ostream& _cout, const Date& d){_cout << d._year << "-" << d._month << "-" << d._day;return _cout;}struct HashDate{size_t operator()(const Date& d){size_t hash = 0;hash += d._year;hash *= 31;hash += d._month;hash *= 31;hash += d._day;hash *= 31;return hash;}};void TestM1(){unordered_Map<int, int> m;m.insert(make_pair(66, 66));m.insert(make_pair(77, 77));m.insert(make_pair(160, 160));unordered_Map<int, int>::iterator it = m.begin();while (it != m.end()){cout << it->first << ":" << it->second << endl;++it;}cout << endl;}void TestM2(){Date d1(2016, 3, 15);Date d2(2016, 3, 15);Date d3(2016, 3, 12);Date d4(2016, 3, 11);Date d5(2023, 3, 12);Date d6(2023, 3, 13);Date a[] = { d1, d2, d3, d4, d5, d6 };unordered_Map<Date, int, HashDate> cm;for (auto e : a){cm[e]++;}for (auto& kv : cm){cout << kv.first << ":" << kv.second << endl;}cout << endl;}
}

unordered_Set.h

#pragma once
#include"OpenHash.h"
namespace nza
{template<class K,class Hash=HashFunc<K>>class unordered_Set{public:struct SetKey{const K& operator()(const K& key){return key;}};public:typedef typename  HashTable< K, K, SetKey,Hash>::const_iterator iterator;typedef typename  HashTable< K, K, SetKey,Hash>::const_iterator const_iterator;iterator begin(){return _ht.begin();}iterator end(){return _ht.end();}const_iterator begin() const{return _ht.begin();}const_iterator end() const{return _ht.end();}pair<iterator,bool> insert(const K& key){return _ht.Insert(key);}iterator find(const K& key){return _ht.Find(key);}bool erase(const K& key){return _ht.Erase(key);}private:HashTable<K, K, SetKey,Hash> _ht;};void TestS1(){int a[] = { 1, 6, 4, 66, 160, 32, 44 };unordered_Set<int> s;for (auto e : a){s.insert(e);}s.insert(35);s.insert(117);unordered_Set<int>::iterator it = s.begin();while (it != s.end()){cout << *it << " ";++it;}cout << endl;for (auto e : s){cout << e << " ";}cout << endl;}
}

test.cpp


#include"OpenHash.h"
#include"unordered_Set.h"
#include"unordered_Map.h"int main()
{nza::TestM1();nza::TestM2();nza::TestS1();}