• Stars
    star
    2
  • Language
    R
  • Created over 4 years ago
  • Updated over 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Implemented Data Warehouse and Business Intelligence project using different structured and unstructured data sources of various Incidents and Mortality rates of Tuberculosis for 197 countries around the world. The aim was to analyze the trends in mortality and incidence rates in countries around the world for tuberculosis. Data was web scrapped, cleansed and loaded using ETL designed star schema and deployed OLAP cube. Non-trivial BI queries were generated. First of all the data was extracted, cleaned and transformed using R language and further injected and loaded into SSMS where dimension tables were created using Insert query task. Kimbell's bottom-up approach was used to design the star schema in SSIS. Finally the cube was deployed in SSAS. Tableau was used for visual analytics to create dashboards. Technologies used: MS SQL, SQL Server Integration Services, SQL Server Analysis Services, Tableau. Video link of execution with explanation is available.