Seongeun Kwon kse0202

## concat_files.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kse0202
                / concat_files.md
            
            
              Created
              June 28, 2021 08:52
            
              
                폴더 안에 있는 파일(excel, csv)들 한개로 합친 dataframe 만들기
              
          
    폴더안에 있는 파일들 하나로 만들기

import pandas as pd
import os
import glob


all_data = []

  
## def_hddd_to_wgs84.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kse0202
                / def_hddd_to_wgs84.md
            
            
              Created
              June 28, 2021 07:26
            
              
                hddd 형식으로 표현된 좌표를 wgs84 형식으로 변환하는 함수 정의
              
          
    hddd_to_wgs84(table,x,y) 함수 정의 -> 이걸 이용해서 변환한다

hddd.ddddd“ 는 도 단위를 더욱 세분화 하기 위해 도 단위의 소수 자리까지 표현한 형태
# table = 좌표를 계산해야 할 테이블 명
# x, y  = table안의 x, y 컬럼 명, 'x','y'로 넣어준다

def hddd_to_wgs84(table,x,y):

  
## postgresql_1.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kse0202
                / postgresql_1.md
            
            
              Created
              June 28, 2021 07:16
            
              
                postgresql의 geometry 함수(st_point, st_makeline, st_setstrid), lead함수(python으로 하는것도) 
              
          
    ST_SetSRID, geometry 컬럼에 SRID 지정

ST_SetSRID(st_point(start_x ::double precision, start_y::double precision), 4326)

ST_Transdorm, SRID 변경

query_set_srid = "ALTER TABLE table_nm \
                    ALTER COLUMN geom TYPE geometry(point, 5179)\

  
## python_postgres_table_download_update.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              1 star
            
          
                kse0202
                / python_postgres_table_download_update.md
            
            
              Last active
              July 11, 2023 04:43
            
              
                Jupyter Notebook에서 python으로 1. postgresql DB에 접속 2. DB내 테이블 DataFrame으로 가져오기 3. DataFrame을 기존 DB내 테이블에 업데이트
              
          
    #-*-coding:utf-8
import psycopg2
import pandas as pd
import numpy as np
import csv

import sql
from sqlalchemy import create_engine

  
## python_postgres_db_connect.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kse0202
                / python_postgres_db_connect.md
            
            
              Last active
              June 28, 2021 05:33
            
              
                Jupyter Notebook에서 python으로 PostgreSQL DB 접속하는 방법 (psycopg2 이용)
              
          
    # 패키지 불러오기

#-*-coding:utf-8
import psycopg2
import pandas as pd
import csv

import sql
from sqlalchemy import create_engine

  
## remove_emoji.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kse0202
                / remove_emoji.md
            
            
              Last active
              June 9, 2021 12:52
            
              
                python re패키지로 텍스트 전처리
              
          
    python re패키지를 이용하여
한글 자음, 한글 모음, 특수문자, 이모티콘 삭제하기
import re


def get_clean_text(df):

  
## youtube_crawler_20210512.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kse0202
                / youtube_crawler_20210512.md
            
            
              Last active
              June 9, 2021 12:31
            
              
                유튜브 댓글 크롤링 20210512 
              
          
    selenium을 이용하여 유튜브 댓글 크롤링하기

키워드 입력 후 검색되는 영상의 제목, url, 정보를 수집하여 데이터프레임 및 csv로 저장
저장된 url을 조회하여 영상의 댓글을 수집하여 데이터 프레임 및 csv로 저장
모든 댓글을 1개의 csv로 저장

1. 패키지 불러오기

# -*- coding:utf-8 -*-

  
## remove_blanks_right_left.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kse0202
                / remove_blanks_right_left.md
            
            
              Last active
              September 21, 2020 09:43
            
              
                왼쪽 오른쪽 공백 지우기
              
          
    import re

def clean_blank(text):
    
    cleaned_text = text.lstrip() #왼쪽 공백 제거
    cleaned_text = cleaned_text.rstrip() #오른쪽 공백 제거
 return cleaned_text```


## select_rows_contains_word.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kse0202
                / select_rows_contains_word.md
            
            
              Created
              May 14, 2020 02:12
            
              
                데이터 프레임에서 특정 단어 포함하는 행 선택하기
              
          
    df.loc[df_mapo['col_name'].str.contains('words', na= False)]


## sampling.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kse0202
                / sampling.md
            
            
              Created
              April 1, 2020 02:24
            
              
                불균형 데이터를 처리하기 위한 샘플링 종류
              
          
    UnderSampling


RandomUnderSampler: random under-sampling method
TomekLinks: Tomek’s link method
CondensedNearestNeighbour: condensed nearest neighbour method
OneSidedSelection: under-sampling based on one-sided selection method
EditedNearestNeighbours: edited nearest neighbour method
NeighbourhoodCleaningRule: neighbourhood cleaning rule

OverSampling


RandomOverSampler: random sampler