Angelo Ramírez Ortega (@angeloortega)
  • Stars
    star
    3
  • Global Rank 1,854,108 (Top 64 %)
  • Followers 12
  • Following 21
  • Registered over 7 years ago
  • Most used languages
    C
    33.3 %
    Java
    33.3 %
    Python
    33.3 %
  • Location 🇨🇷 Costa Rica
  • Country Total Rank 667
  • Country Ranking
    C
    49
    Java
    129
    Python
    211

Top repositories

1

DSM

Distributed Shared Memory programs written in C
C
1
star
2

Yoogling

This projects intends to index a huge amount of files and accessing those files upon a search like a search engine by using term frequency (TF) and inverted document frequency (IDF) to help determine how important are the words in a text and which words give more information about the text. To be able to find the most relevant texts upon a given search query. This project indexes million of archives and stores the a file of the indexed archives via the TF-IDF algorithm. It stores the information in files because of the huge amount of data it is managing. When the query is made it searches upon the file with the indexed terms to look for the document that has the highest TF-IDF for the words in the query. By managing data in files we achieve the fastest solution and avoid overcrowding the main memory that would happen if everything is kept under variables in the project. The information is written to the indexed file partially to avoid giving to much data to main memory that would slow down the indexing process. Instead of indexing all the files and then writing it to the specific file, every given amount of files all the data is written and the main memory is cleared so everything is kept as optimal as possible. The goal is to achieve a program that indexes huge amounts of data and searches for the most relevant files upon a query in the most effective way by using indexing algorithms like TF-IDF and query solving algorithms like cosine similarity.
Python
1
star
3

Progra1_POO

ITCR Angelo Ramirez Ortega -Eduardo Jiron - Alejandro Carmona
Java
1
star