Using language-based search in mining large software repositories

Language component plays an important role in data/information retrieval. Data retrieval in software engineering is often hindered by the difficulty of getting data from commercial software. The emergence of the open source repositories has contributed tremendously in the collection of software da...

Full description

Bibliographic Details
Main Author: Awang Abu Bakar, Normi Sham
Format: Conference or Workshop Item
Language:English
Published: 2011
Subjects:
Online Access:http://irep.iium.edu.my/8451/
http://irep.iium.edu.my/8451/
http://irep.iium.edu.my/8451/1/PACLING_AwangAbuBakar.pdf
id iium-8451
recordtype eprints
spelling iium-84512011-12-20T05:51:21Z http://irep.iium.edu.my/8451/ Using language-based search in mining large software repositories Awang Abu Bakar, Normi Sham QA75 Electronic computers. Computer science Language component plays an important role in data/information retrieval. Data retrieval in software engineering is often hindered by the difficulty of getting data from commercial software. The emergence of the open source repositories has contributed tremendously in the collection of software data. This paper highlights the data retrieval method for mining software from a vast open source software repository, SourceForge. For the purpose of automating the data retrieval from the repository, a parser was written using the Python programming language, and based on the pattern matching algorithm. The retrieved data were later used to estimate the quality of the open source software. 2011-12-17 Conference or Workshop Item PeerReviewed application/pdf en http://irep.iium.edu.my/8451/1/PACLING_AwangAbuBakar.pdf Awang Abu Bakar, Normi Sham (2011) Using language-based search in mining large software repositories. In: Pacific Association for Computational Linguistics (PACLING 2011), 19-21 July 2011, Kuala Lumpur. http://www.sciencedirect.com/science/article/pii/S1877042811024219
repository_type Digital Repository
institution_category Local University
institution International Islamic University Malaysia
building IIUM Repository
collection Online Access
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Awang Abu Bakar, Normi Sham
Using language-based search in mining large software repositories
description Language component plays an important role in data/information retrieval. Data retrieval in software engineering is often hindered by the difficulty of getting data from commercial software. The emergence of the open source repositories has contributed tremendously in the collection of software data. This paper highlights the data retrieval method for mining software from a vast open source software repository, SourceForge. For the purpose of automating the data retrieval from the repository, a parser was written using the Python programming language, and based on the pattern matching algorithm. The retrieved data were later used to estimate the quality of the open source software.
format Conference or Workshop Item
author Awang Abu Bakar, Normi Sham
author_facet Awang Abu Bakar, Normi Sham
author_sort Awang Abu Bakar, Normi Sham
title Using language-based search in mining large software repositories
title_short Using language-based search in mining large software repositories
title_full Using language-based search in mining large software repositories
title_fullStr Using language-based search in mining large software repositories
title_full_unstemmed Using language-based search in mining large software repositories
title_sort using language-based search in mining large software repositories
publishDate 2011
url http://irep.iium.edu.my/8451/
http://irep.iium.edu.my/8451/
http://irep.iium.edu.my/8451/1/PACLING_AwangAbuBakar.pdf
first_indexed 2023-09-18T20:18:10Z
last_indexed 2023-09-18T20:18:10Z
_version_ 1777407928169398272