词云图wordcloud

1.安装第三方库

$ji e ba 库、 ma tpl o tl ib 、 w or d c l o u d 库$
民图灵机

2.过程

1.使用 $ji e ba$ 库对数据进行分词整理，转为 $t x t$ 文件，转变为以空格分隔的词语字符串 $s t r in g$ 。
2.调用 $w or d co l u d$ 等函数绘制。

3.wordcloud的常用方法函数参数

参数：

1. $font\_path : string$ : 字体路径，格式：字体路径+后缀名，
如 $\ w i n d o w s \ F o n t \ w h i t e . t t f C:\backslash windows\backslash Font \backslash white.ttf$
2. $w i d t h : in t (d e f a u lt = 400)$ : 输出的画布宽度
3. $h e i g h t : in t (d e f a u lt = 200)$ ：输出的画布高度
4. $prefer\_horizontal : float(default=0.90)$ : 词语水平方向排版出现的频率，垂直方向做差。
5. $sc a l e : f l o a t (d e f a u lt = 1)$ : 按照比例放大画布，如设置 $sc a l e = 2$ ，则长宽都是原来的 $2$ 倍。
6. $min\_font\_size : int(default=4)$ : 显示的最小字体的大小。
7. $max\_words : int(default=200)$ : 显示的词的最大个数。
8. $background\_color : (default='black')$ ：背景颜色。
9. $max\_font\_size : int(default=None)$ : 显示的最大字体的大小。
10. $ma s k : n p . a rr a y 、 N o n e$ ：参数为空，默认词云形状为长方形。

函数：

1. $generate\_from\_text(text)$ ：根据文本生成词云。
2. $g e n er a t e (t e x t)$ : 根据文本生成词云。
3. $generate\_from\_frequencies(frequencies[, ...])$ : 根据词频生成词云。
4. $to\_file(filename)$ : 输出到文件。

def generate(self, text):"""Generate wordcloud from text.The input "text" is expected to be a natural text. If you pass a sortedlist of words, words will appear in your output twice. To remove thisduplication, set ``collocations=False``.Alias to generate_from_text.Calls process_text and generate_from_frequencies.Returns-------self"""return self.generate_from_text(text)def generate_from_text(self, text):"""Generate wordcloud from text.The input "text" is expected to be a natural text. If you pass a sortedlist of words, words will appear in your output twice. To remove thisduplication, set ``collocations=False``.Calls process_text and generate_from_frequencies...versionchanged:: 1.2.2Argument of generate_from_frequencies() is not return ofprocess_text() any more.Returns-------self"""words = self.process_text(text)self.generate_from_frequencies(words)return self