'Python' 카테고리의 글 목록 (4 Page)

250x250

Notice

Recent Posts

Recent Comments

Link

« 2025/10 »
일	월	화	수	목	금	토
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Tags more

Archives

Today

Total

관리 메뉴

목록Python (125)

무회 Blog

파이썬, 웹소켓

Python 2020. 9. 1. 08:35

파이썬, restful 연결,

Python 2020. 9. 1. 08:31

한국어불용어

Python 2020. 9. 1. 08:27

Babi 데이터셋 전처리하기,QnA

In [1]: # pip install customized_konlpy In [2]: from ckonlpy.tag import Twitter twitter = Twitter() twitter.morphs('은경이는 사무실로 갔습니다.') C:\anacondas\lib\site-packages\konlpy\tag\_okt.py:16: UserWarning: "Twitter" has changed to "Okt" since KoNLPy v0.4.5. warn('"Twitter" has changed to "Okt" since KoNLPy v0.4.5.') Out[2]: ['은', '경이', '는', '사무실', '..

Python 2020. 8. 31. 17:56

QnA 모듈분류 및 테스트

001. from libs import * df['cut_content'] = df['content'].apply(lambda x: " ".join(w for w in word_tokenize(str(x)))) # 这里我们使用了参数ngram_range=(1,2) # ,这表示我们除了抽取评论中的每个词语外 # ,还要抽取每个词相邻的词并组成一个“词语对”,如: 词1，词2，词3，词4，(词1，词2)，(词2,词3)，(词3，词4)。 # 这样就扩展了我们特征集的数量,有了丰富的特征集才有可能提高我们分类文本的准确度。 # 参数norm='l2',是一种数据标准划处理的方式,可以将数据限制在一点的范围内比如说(-1,1) tfidf = TfidfVectorizer(norm='l2', ngram_range=(1, 2)) cut_contents =..

Python 2020. 8. 28. 17:07

Prev 1 2 3 4 5 6 7 ··· 25 Next

목록Python (125)

무회 Blog

티스토리툴바