Significant Advances in Application Resiliency: The Data Engineering Perspective on Network Performance Metrics
DOI:
https://doi.org/10.47941/jts.2408Keywords:
Data Engineering, Network Observability, Telemetry, Reliability, Resiliency, Scalability, FCAPSAbstract
Purpose: The purpose of the article is to explore how network metrics like latency, packet loss, and throughput, combined with application logs, can help organizations improve the reliability and performance of their applications. It focuses on how these insights support Site Reliability Engineering (SRE) teams in proactively addressing issues to achieve better application resiliency, enhancing user experience and market trust.
Methodology: The article uses a combination of case studies and analysis to demonstrate how monitoring specific network metrics and application logs helps identify and resolve performance issues. It examines real-world scenarios where proactive adjustments based on these metrics improved application reliability and aligned with organizational objectives.
Findings: The findings show that by analyzing network metrics and application logs, organizations can pinpoint causes of transaction failures, such as high latency, packet loss, or misconfigured firewalls. Proactive resolutions based on these insights result in smoother application performance, reduced downtime, and increased user satisfaction.
Unique Contribution to theory, practice and policy: This article makes valuable contributions to theory, practice, and policy. For theory, it expands the understanding of how network metrics and application logs can work together to improve application resiliency, offering a framework for integrating Site Reliability Engineering (SRE) principles with network observability. For practice, it provides clear, actionable steps for SRE teams to identify and resolve performance issues, helping organizations enhance reliability and user satisfaction. For policy, it highlights the importance of proactive network monitoring and metric-driven decision-making, encouraging organizations to adopt policies that prioritize resiliency, ensure consistent performance, and meet service-level agreements (SLAs).
Downloads
References
https://joindigital.com/naas360/network-observability
Jayanna Hallur, "The Future of SRE: Trends, Tools, and Techniques for the Next Decade", International Journal of Science and Research (IJSR), Volume 13 Issue 9, September 2024, pp. 1688-1698, URL: https://www.ijsr.net/getabstract.php?paperid=SR24927125336, DOI: https://www.doi.org/10.21275/SR24927125336
What Is Observability? Key Components and Best Practices https://www.honeycomb.io/what-is-observability/
Jayanna Hallur, "From Monitoring to Observability: Enhancing System Reliability and Team Productivity", International Journal of Science and Research (IJSR), Volume 13 Issue 10, October 2024, pp. 602-606, URL: https://www.ijsr.net/getabstract.php?paperid=SR241004083612, DOI: https://www.doi.org/10.21275/SR241004083612
Sasongko, H., & Hadiwandra, T. Y. (2021). Cloud-Based NAS (Network Attached Storage) Analysis as an Infrastructure as A Service (IAAS) Using Open Source NAS4FREE and Owncloud. https://doi.org/10.25299/itjrd.2022.5712
Xu, L., Jiang, W., Zhang, Q., & Wang, X. (2020). A comprehensive survey on bandwidth estimation techniques in high-speed networks. IEEE Communications Surveys & Tutorials, 22(3), 2038–2070. https://doi.org/10.1109/COMST.2020.3001035
Mehani, O., Boreli, R., & Henderson, T. (2015). Latency and loss measurements in mobile networks: Empirical evaluation of the state of the art. ACM SIGCOMM Computer Communication Review, 45(4), 25–30. https://doi.org/10.1145/2831347.2831351
Nagarajan, R., & Selvi, V. (2021). Resource utilization optimization in cloud-based network devices using deep learning. Journal of Network and Computer Applications, 186, 103095. https://doi.org/10.1016/j.jnca.2021.103095
Johnson, D., Kumar, R., & Nguyen, T. (2018). Dynamic load balancing and interface utilization in software-defined networking. Journal of Network and Systems Management, 26(4), 987–1002. https://doi.org/10.1007/s10922-018-9469-5
Ferrus, R., Sallent, O., & Agusti, R. (2014). QoS and jitter optimization in mobile multimedia networks. IEEE Transactions on Multimedia, 16(3), 759–768. https://doi.org/10.1109/TMM.2014.2299356
Scalability and Flexibility https://softechds.com/it-solutions/cloud-solutions/
IEEE Standards Association, "IEEE Standard for Information Technology - Telecommunications and information exchange between systems - Local and metropolitan area networks - Specific requirements," IEEE Std 802.11-2016, 2016
A. Brown, "Performance Monitoring in Modern Networks," Proceedings of the International Conference on Network Management, pp. 112-118, 2019.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Jayanna Hallur
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution (CC-BY) 4.0 License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.