Skip to content

Instantly share code, notes, and snippets.

@techykajal
Created August 28, 2021 13:36
Show Gist options
  • Select an option

  • Save techykajal/7b0538a21726d52709ea554136b73044 to your computer and use it in GitHub Desktop.

Select an option

Save techykajal/7b0538a21726d52709ea554136b73044 to your computer and use it in GitHub Desktop.
# Total no. of tokens in the corpus
tokens = []
for article in DF['Updated_content']:
for word in article:
tokens.append(word)
len(tokens) # There are total 1 lac 50 thousand tokens in the corpus
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment