day 7 commits

lunbixiaozi · lunbixiaozi · commit bd87aed4cdbe · 2019-03-29T21:21:23.000-04:00
diff --git a/19100303/lunbixiaozi/mymodule/main.py b/19100303/lunbixiaozi/mymodule/main.py
@@ -0,0 +1,38 @@
+import stats_word
+import re
+
+text = '''
+愚公移山
+太行，王屋二山的北面，住了一個九十歲的老翁，名叫愚公。二山佔地廣闊，擋住去路，使他和家人往來極為不便。
+一天，愚公召集家人說：「讓我們各盡其力，剷平二山，開條道路，直通豫州，你們認為怎樣？」
+大家都異口同聲贊成，只有他的妻子表示懷疑，並說：「你連開鑿一個小丘的力量都沒有，怎可能剷平太行、王屋二山呢？況且，鑿出的土石又丟到哪裏去呢？」
+大家都熱烈地說：「把土石丟進渤海裏。」
+於是愚公就和兒孫，一起開挖土，把土石搬運到渤海去。
+愚公的鄰居是個寡婦，有個兒子八歲也興致勃勃地走來幫忙。
+寒來暑往，他們要一年才能往返渤海一次。
+住在黃河河畔的智叟，看見他們這樣辛苦，取笑愚公說：「你不是很愚蠢嗎？你已一把年紀了，就是用盡你的氣力，也不能挖去山的一角呢？」
+愚公歎息道：「你有這樣的成見，是不會明白的。你比那寡婦的小兒子還不如呢！就算我死了，還有我的兒子，我的孫子，我的曾孫子，他們一直傳下去。而這二山是不會加大的，總有一天，我們會把它們剷平。」
+智叟聽了，無話可說：
+二山的守護神被愚公的堅毅精神嚇倒，便把此事奏知天帝。天帝佩服愚公的精神，就命兩位大力神揹走二山。
+
+How The Foolish Old Man Moved Mountains
+Yugong was a ninety-year-old man who lived at the north of two high mountains, Mount Taixing and Mount Wangwu.
+Stretching over a wide expanse of land, the mountains blocked yugong’s way making it inconvenient for him and his family to get around.
+One day yugong gathered his family together and said,”Let’s do our best to level these two mountains. We shall open a road that leads to Yuzhou. What do you think?”
+All but his wife agreed with him.
+“You don’t have the strength to cut even a small mound,” muttered his wife. “How on earth do you suppose you can level Mount Taixin and Mount Wanwu? Moreover, where will all the earth and rubble go?”
+“Dump them into the Sea of Bohai!” said everyone.
+So Yugong, his sons, and his grandsons started to break up rocks and remove the earth. They transported the earth and rubble to the Sea of Bohai.
+Now Yugong’s neighbour was a widow who had an only child eight years old. Evening the young boy offered his help eagerly.
+Summer went by and winter came. It took Yugong and his crew a full year to travel back and forth once.
+On the bank of the Yellow River dwelled an old man much respected for his wisdom. When he saw their back-breaking labour, he ridiculed Yugong saying,”Aren’t you foolish, my friend? You are very old now, and with whatever remains of your waning strength, you won’t be able to remove even a corner of the mountain.”
+Yugong uttered a sigh and said,”A biased person like you will never understand. You can’t even compare with the widow’s little boy!”
+“Even if I were dead, there will still be my children, my grandchildren, my great grandchildren, my great great grandchildren. They descendants will go on forever. But these mountains will not grow any taler. We shall level them one day!” he declared with confidence.
+The wise old man was totally silenced.
+When the guardian gods of the mountains saw how determined Yugong and his crew were, they were struck with fear and reported the incident to the Emperor of Heavens.
+Filled with admiration for Yugong, the Emperor of Heavens ordered two mighty gods to carry the mountains away.
+'''
+
+
+stats_word.stats_text(text)
+
diff --git a/19100303/lunbixiaozi/mymodule/stats_word.py b/19100303/lunbixiaozi/mymodule/stats_word.py
@@ -0,0 +1,101 @@
+#-*- coding: UTF-8 -*- 
+import collections
+import os
+
+#text = 
+'''
+The Zen of Python, by Tim Peters
+
+
+Beautiful is better than ugly.
+Explicit is better than implicit.
+Simple is better than complex.
+Complex is better than complicated.
+Flat is better than nested.
+Sparse is better than dense.
+Readability counts.
+Special cases aren't special enough to break the rules.
+Although practicality beats purity.
+Errors should never pass silently.
+Unless explicitly silenced.
+In the face of ambxiguity, refuse the temptation to guess.
+There should be one-- and preferably only one --obvious way to do it.
+Although that way may not be obvious at first unless you're Dutch.
+Now is better than never.
+Although never is often better than *right* now.
+If the implementation is hard to explain, it's a bad idea.
+If the implementation is easy to explain, it may be a good idea.
+Namespaces are one honking great idea -- let's do more of those!
+'''
+
+#text_cn = 
+'''
+
+来自管理员童鞋的回复：可以自己定义哈，主要是实现函数的功能
+完成时可以自己写一些测试的参数，检验自己的函数功能是否正确
+
+'''
+
+def stats_text_en (text): #sort English words by the frequency.
+
+    for i in range(len(text)):
+        if (text[i] >= u'\u0041' and text[i]<=u'\u005a') or (text[i] >= u'\u0061' and text[i]<=u'\u007a'):
+            break
+
+
+    text_en = text[i:]
+    text_en = text_en.replace('--', '')
+    text_en = text_en.replace('!', '')
+    text_en = text_en.replace('*', '')
+    text_en = text_en.replace('.', ' ')
+    text_en = text_en.replace(',', '')
+
+    # print("CN words frequency: ")
+    # print(text_en)
+
+    text_en = text_en.split()
+
+    counter_en = collections.Counter(text_en)
+    print("\n\nEN words frequency: ")
+    print(counter_en)
+
+    return counter_en
+
+
+
+
+def stats_text_cn (text): #sort Chinese words by the frequency.
+    text_cn = ''
+
+    for ch in text:
+        if u'\u4e00' <= ch <= u'\u9fff': #only fetch the Chinese characthers
+            text_cn = text_cn + ch
+
+
+    # text = text.replace('：', '')
+    # text = text.replace('，', '')
+    # text = text.replace('\n', '')
+    #text = text.replace('*', '')
+    #print ('first char:')
+    #print (text[0])
+
+    text_split = []
+
+    for i in range(len(text_cn)):
+        text_split.append(text_cn[i])
+
+    #text = text.split()
+
+    counter_cn = collections.Counter(text_split)
+    print("CN wrods frequency: ")
+    print(counter_cn)
+    return counter_cn
+
+#print(stats_text_cn(text_cn))
+
+
+def stats_text (text): #call the functions above
+    
+    stats_text_cn (text)
+    stats_text_en (text)
+