Smart Data LifeCycle as a process cartography

By Mohammed EL ARASS, Nissrine SOUISSI

Abstract


Data management is becoming increasingly complex, especially with the emergence of the Big Data era. The best way to manage this data is to dispose a data lifecycle from creation to destruction. This paper proposes a new Data LifeCycle (DLC) named Smart DLC that helps to make from raw and worthless data to Smart Data in a Big Data context. In order to do this, we have followed a method which consists firstly in identifying and analyzing the lifecycles from a literature review, and then in defining the phases of our cycle and finally in modeling it. The cycle is modeled as a process cartography resulting from the ISO 9001: 2015 standard and the CIGREF framework to facilitate its implementation within companies. Smart DLC is qualified as a set of management, realization and support processes that could be addressed by an Information System urbanization approach. The advantage of modeling the phases such as processes is to be concerned not only with the technical activities but also with management, which is a major player for the success of the technique.

Full Text:

PDF

References


J. Manyika et al., ‘Big data: The next frontier for innovation, competition, and productivity’, 2011.

M. El arass, I. Tikito, and N. Souissi, ‘Data lifecycles analysis: towards intelligent cycle’, in Proceeding of The second International Conference on Intelligent Systems and Computer Vision, ISCV’2017, Fès 17-19 April, Fez, Morocco, 2017. https://doi.org/10.1109/ISACV.2017.8054938

N. Khan et al., ‘Big data: survey, technologies, opportunities, and challenges’, Sci. World J., vol. 2014, 2014.

A. Lenk, L. Bonorden, A. Hellmanns, N. Roedder, and S. Jaehnichen, ‘Towards a taxonomy of standards in smart data’, in Big Data (Big Data), 2015 IEEE International Conference on, 2015, pp. 1749–1754.

D. Farge, ‘Du Big data au smart data : retour vers un marketing de l'émotion et de la confiance’, LesEchos.fr, 2015.

F. Meleard, ‘Smart data, l'avenir du contenu’, Echosfr, 2015.

X. Ma, P. Fox, E. Rozell, P. West, and S. Zednik, ‘Ontology dynamics in a data life cycle: challenges and recommendations from a Geoscience Perspective’, J. Earth Sci., vol. 25, no. 2, pp. 407–412, 2014.

S. Allard, ‘DataONE: Facilitating eScience through collaboration’, J. EScience Librariansh., vol. 1, no. 1, p. 3, 2012.

IBM, ‘Wrangling big data: Fundamentals of data lifecycle management’, 2013.

S. BOUTEILLER, Enjeux business des données. Comment gérer les données de l'entreprise pour créer de la valeur? CIGREF, 2014.

L. Lin, T. Liu, J. Hu, and J. Zhang, ‘A privacy-aware cloud service selection method toward data life-cycle’, in Parallel and Distributed Systems (ICPADS), 2014 20th IEEE International Conference on, 2014, pp. 752–759.

S. Chaki, ‘The Lifecycle of Enterprise Information Management’, in Enterprise Information Management in Practice, Springer, 2015, pp. 7–14.

J. L. Faundeen et al., ‘The United States Geological Survey Science Data Lifecycle Model’, US Geological Survey, 2014.

J. B. Jade Reynolds, In the context of the Convention on Bilogical Diversity. World Conservation Monitoring Centre, 1996.

Y. Demchenko, C. De Laat, and P. Membrey, ‘Defining architecture components of the Big Data Ecosystem’, in Collaboration Technologies and Systems (CTS), 2014 International Conference on, 2014, pp. 104–112.

C. L. Borgman, J. C. Wallis, M. S. Mayernik, and A. Pepe, ‘Drowning in data: digital library architecture to support scientific use of embedded sensor networks’, in Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries, 2007, pp. 269–277.

E. Deelman and A. Chervenak, ‘Data management challenges of data-intensive scientific workflows’, in Cluster Computing and the Grid, 2008. CCGRID’08. 8th IEEE International Symposium on, 2008, pp. 687–692.

I. Gam, ‘Ingénierie des exigences pour les systèmes d’information décisionnels: concepts, modèles et processus: la méthode CADWE’, Paris 1, 2008.

W. L. Chang, ‘NIST Big Data Interoperability Framework: Volume 6, Reference Architecture’, Spec. Publ. NIST SP - 1500-6, Aug. 2017.

X. Yu and Q. Wen, ‘A view about cloud data security from data life cycle’, in Computational Intelligence and Software Engineering (CiSE), 2010 International Conference on, 2010, pp. 1–4.

A. Michota and S. Katsikas, ‘Designing a seamless privacy policy for social networks’, in Proceedings of the 19th Panhellenic Conference on Informatics, 2015, pp. 139–143.

Organisation internationale de normalisation, ‘Quality management systems requirements’. 2015.

CIGREF, ‘Les référentiels de la DSI : Etat de l’art usage s et bonnes pratiques’. 2009.

M. El arass, I. Tikito, and N. Souissi, ‘An Audit Framework for Data lifecycles in Big Data context’, in Proceeding of The International conference on selected topics in Mobile and Wireless Networking, Tangier, Morocco,2018.https://doi.org/10.1109/MoWNet.2018.8428883

M. El arass and N. Souissi, ‘Data Lifecycle: From Big Data to Smart Data’, in IEEE CiSt’18, Marrakech, Morocco, 2018.

K. Krishnan, Data warehousing in the age of big data. Newnes, 2013.

A. Reeve, Managing Data in Motion: Data Integration Best Practice Techniques and Technologies. Newnes, 2013.

M. Chen, S. Mao, and Y. Liu, ‘Big data: A survey’, Mob. Netw. Appl., vol. 19, no. 2, pp. 171–209, 2014.

K. Davis, Ethics of Big Data: Balancing risk and innovation. O’Reilly Media, Inc., 2012.

L. Pouchard, ‘Revisiting the Data Lifecycle with Big Data Curation’, Int. J. Digit. Curation, vol. 10, no. 2, pp. 176–192, May 2016.

L. Douglas, ‘3d data management: Controlling data volume, velocity and variety’, Gart. Retrieved, vol. 6, p. 2001, 2001.

P. Zikopoulos, C. Eaton, and others, Understanding big data: Analytics for enterprise class hadoop and streaming data. McGraw-Hill Osborne Media, 2011.

IBM, ‘The four v’s of big data’. .

I. Tikito and N. Souissi, ‘Data Collect Requirements Model’, in Proceeding of the BDCA’2017, Tetuan 28-30 March, 2017. https://doi.org/10.1145/3090354.3090358