Pham Trung Viet

v2.0.1

EN

Dark
Light

Liberate

An online library that is fully equiped with distributed analytics cluster
MySQL MongoDB Python React NodeJS Bash Hadoop AWS


About the Project:

Liberate is a fullstack webapp that stores books information and reviews. It has automated scaling mechanism for its database which utilise both MongoDB and MySQL to stores books' information and review. The system also employs Hadoop and PySpark for analytic tasks. All of the system are instantiated and hosted on AWS EC2 instances.

Application main screen

Features:

  • Automatic spinning up and tearing down of the entire system and automatic scaling of hadoop analytic clusters. The automation was achieved using python and bash script during instance instantiation.
  • The Web Application allows user to filter and browse through available books, add new books and post review of the books.
  • The analytic system using PySpark and Hadoop can output tf-idf on the review and pearson correlation between price and average review length periodically.

  • System network diagram - from 50.043 Course Website

    This project was made for 50.043 Database and Big Data course together with 4 more members.


    Liberate search screen

    Liberate book display screen