Significant Advances in Application Resiliency: The Data Engineering Perspective on Network Performance Metrics

Authors

  • Jayanna Hallur

DOI:

https://doi.org/10.47941/jts.2408

Keywords:

Data Engineering, Network Observability, Telemetry, Reliability, Resiliency, Scalability, FCAPS

Abstract

Purpose: The purpose of the article is to explore how network metrics like latency, packet loss, and throughput, combined with application logs, can help organizations improve the reliability and performance of their applications. It focuses on how these insights support Site Reliability Engineering (SRE) teams in proactively addressing issues to achieve better application resiliency, enhancing user experience and market trust.

Methodology: The article uses a combination of case studies and analysis to demonstrate how monitoring specific network metrics and application logs helps identify and resolve performance issues. It examines real-world scenarios where proactive adjustments based on these metrics improved application reliability and aligned with organizational objectives.

Findings: The findings show that by analyzing network metrics and application logs, organizations can pinpoint causes of transaction failures, such as high latency, packet loss, or misconfigured firewalls. Proactive resolutions based on these insights result in smoother application performance, reduced downtime, and increased user satisfaction.

Unique Contribution to theory, practice and policy: This article makes valuable contributions to theory, practice, and policy. For theory, it expands the understanding of how network metrics and application logs can work together to improve application resiliency, offering a framework for integrating Site Reliability Engineering (SRE) principles with network observability. For practice, it provides clear, actionable steps for SRE teams to identify and resolve performance issues, helping organizations enhance reliability and user satisfaction. For policy, it highlights the importance of proactive network monitoring and metric-driven decision-making, encouraging organizations to adopt policies that prioritize resiliency, ensure consistent performance, and meet service-level agreements (SLAs).

Downloads

Download data is not yet available.

Author Biography

Jayanna Hallur

Sr Lead Engineer in Observability Engineering and Data Architect

References

https://joindigital.com/naas360/network-observability

Jayanna Hallur, "The Future of SRE: Trends, Tools, and Techniques for the Next Decade", International Journal of Science and Research (IJSR), Volume 13 Issue 9, September 2024, pp. 1688-1698, URL: https://www.ijsr.net/getabstract.php?paperid=SR24927125336, DOI: https://www.doi.org/10.21275/SR24927125336

What Is Observability? Key Components and Best Practices https://www.honeycomb.io/what-is-observability/

Jayanna Hallur, "From Monitoring to Observability: Enhancing System Reliability and Team Productivity", International Journal of Science and Research (IJSR), Volume 13 Issue 10, October 2024, pp. 602-606, URL: https://www.ijsr.net/getabstract.php?paperid=SR241004083612, DOI: https://www.doi.org/10.21275/SR241004083612

Sasongko, H., & Hadiwandra, T. Y. (2021). Cloud-Based NAS (Network Attached Storage) Analysis as an Infrastructure as A Service (IAAS) Using Open Source NAS4FREE and Owncloud. https://doi.org/10.25299/itjrd.2022.5712

Xu, L., Jiang, W., Zhang, Q., & Wang, X. (2020). A comprehensive survey on bandwidth estimation techniques in high-speed networks. IEEE Communications Surveys & Tutorials, 22(3), 2038–2070. https://doi.org/10.1109/COMST.2020.3001035

Mehani, O., Boreli, R., & Henderson, T. (2015). Latency and loss measurements in mobile networks: Empirical evaluation of the state of the art. ACM SIGCOMM Computer Communication Review, 45(4), 25–30. https://doi.org/10.1145/2831347.2831351

Nagarajan, R., & Selvi, V. (2021). Resource utilization optimization in cloud-based network devices using deep learning. Journal of Network and Computer Applications, 186, 103095. https://doi.org/10.1016/j.jnca.2021.103095

Johnson, D., Kumar, R., & Nguyen, T. (2018). Dynamic load balancing and interface utilization in software-defined networking. Journal of Network and Systems Management, 26(4), 987–1002. https://doi.org/10.1007/s10922-018-9469-5

Ferrus, R., Sallent, O., & Agusti, R. (2014). QoS and jitter optimization in mobile multimedia networks. IEEE Transactions on Multimedia, 16(3), 759–768. https://doi.org/10.1109/TMM.2014.2299356

Scalability and Flexibility https://softechds.com/it-solutions/cloud-solutions/

IEEE Standards Association, "IEEE Standard for Information Technology - Telecommunications and information exchange between systems - Local and metropolitan area networks - Specific requirements," IEEE Std 802.11-2016, 2016

A. Brown, "Performance Monitoring in Modern Networks," Proceedings of the International Conference on Network Management, pp. 112-118, 2019.

Downloads

Published

2024-12-17

How to Cite

Hallur, J. (2024). Significant Advances in Application Resiliency: The Data Engineering Perspective on Network Performance Metrics. Journal of Technology and Systems, 6(7), 60–71. https://doi.org/10.47941/jts.2408

Issue

Section

Articles