Project detail

Project information

  • Category: Data Processing and Cloud
  • Tools: AWS, Python, Tableau
  • Project URL: Github

Developed a data pipeline that mines data from sources including Twitter, Commoncrawl, New York Times and processes it using AWS MapReduce for Word Count and works co-occurrence problem. Implemented a word cloud for both programs using Tableau