2022 Analytics Project Tasks
Planned 2022 Research
The program includes a selection of interrelated and complementary research efforts that members help prioritize.
Definition and Data Models for Industrywide Overhead and Underground Distribution Asset: This task develops and updates the underlying data models for efficient and effective extraction, transfer, and loading of test, diagnostics, performance, and failure data for use in industry and utility database applications and performance analytics. The data model is used in the EPRI hosted Industrywide Asset Failure and Performance Database. Data models for distribution transformers (including surface-mount, underground, and network), underground cables, and wood poles will be reviewed. This task could also work with project funders to develop a comprehensive prioritized list of additional overhead and underground distribution assets for which data models may be developed in future years.
Industry-wide Overhead and Underground Distribution Performance and Failure Database: This task compiles and analyzes historical failure and performance data on overhead and underground distribution assets in a common format, using information gathered from participating utilities. Results will be distributed in the form of a Web accessible portal and database. The development of failure forms, data gathering, and analysis is expected to follow the same list of assets as in the data model development task.
Analytics for Fleet Management of Overhead and Underground Distribution Assets: This task investigates and develops performance assessment analytics for overhead and underground distribution assets, such as wood poles, underground cables, and distribution transformers (including surface-mount, underground, and network). The analytics are developed using data mining and analysis of periodic inspection results; failure modes and degradation research (carried out in other asset-focused projects in P180); subject matter expert experience; and other inputs, such as family, make, model, manufacturer, and operating environment. The research focuses on enhancements to algorithms and analytic methods for assets such as wood poles, underground cables, and distribution transformers.
Reliability and Resiliency Metrics and Analytics: Historical approaches to managing reliability and resiliency have included cyclical approaches and systemwide investment in hardening and other improvement options. This research task aims to use data analytics along with both traditional and new data sources to evaluate reliability and resiliency enhancement opportunities over the lifecycle of the power systems of interest. This unique approach will consider expected worst case severe weather exposure, climate change and other risk factors, thereby enabling utilities to identify and target investments that will yield benefits in the specific areas where the improvements will be the most impactful. Data for the analysis may come from multiple sources, including the historic storm event records, geospatial informatics systems, smart meter data, inspection data, and others. The analytics approaches used in this research effort enable members to understand historically successful leading practices and how new data sets can bring new insights to the reliability and resiliency challenge. This work will continue to curate successful analytics use cases and create documents that members will be able to leverage to replicate the concepts on their own systems.
Evaluating Costs and Benefits of Reliability/Resiliency Improvement Options: Investments in reliability and resiliency improvement can be difficult to quantify and to cost justify if utilities lack good information about the anticipated benefits. In prior years, EPRI research focused on developing and demonstrating methodologies for evaluating cost/benefits in support of reliability and resiliency improvement investments for specific assets. For 2022, EPRI will expand the focus to avoided cost analytics. The avoided cost analysis focuses on creating useful ratings for storm intensity and associated recovery costs. With these parameters well defined and documented, future reliability and resilience improvements can be quantified in terms of the investment versus the avoided future recovery costs. The work will develop replicable methods that utilities can leverage to guide their investment strategies and better inform the expected benefits to customers.
Geospatial Analytics and Insights: In 2020, EPRI began work with the Analytics Project Set Task Force to define and prioritize the specific use cases where geospatial analysis of distribution assets could be further developed into useful and insightful decision support outcomes. Specific opportunities leverage asset performance data, SCADA information, smart meter data and circuit by circuit power flow performance data. The 2020 and 2021 work demonstrated how this could benefit distribution automation analytics. In 2022 this research will prioritize and conceptualize the most beneficial distribution asset use cases that can be developed into geospatial story maps that visually represent the potential for enhanced asset analytics. All insightful and useful outcomes will be shared with project members by way of a Geospatial Analytics and Insights Guidebook for Distribution Assets. The intent is to keep this guidebook current and to grow the number of use cases annually.
Emerging Data Science Tools and Technologies (AI/ML/SD): This task investigates leading-edge R&D opportunities in the data science space with a focus on applicability to distribution system assets and reliability/resilience. Examples of the research include opportunities to leverage Artificial Intelligence (AI), Natural Language Processing (NLP) and the emergence of Synthetic Data (SD). Project set members benefit from this work by gaining first looks at each leading-edge proof of concept and first opportunities to host pilot implementations. The Project Set Task Force provides input and prioritization of new and emerging opportunities to better understand risks associated with system reliability and resilience performance. This prioritization has resulted in directives to investigate, AI use cases such as AI enabled at risk tree analytics leveraging satellite imagery, NLP applications for analyzing unstructured data, fault event electrical signature databases, and new applications of smart meter data. In 2022 EPRI will continue to offer the opportunity for task force advisors to propose topics for research within the project and those results will be added to the annual emerging data science tools and technologies guidebook.
Synthetic Data for AI Training: The effectiveness of AI systems is dependent on the quantity and quality of the datasets used for training. This can present difficulties for distribution applications, as the failure rate is relatively low compared to the size of the total population, which can result in difficulty building a data set that is large enough to train a model effectively while still remaining representative of the population. This research aims to investigate the options, approaches, and effectiveness of using synthetic data to augment real-world training data. This includes data generation as well as evaluation of AI model effectiveness for models trained using synthetic data.
Natural Language Processing: NLP systems continue to emerge as a tool with promise for distribution inspectors and maintenance personnel. Applications built using these systems are dependent on the ability to accurately detect and transcribe spoken words, which may contain industry jargon, colloquial terminology, and unique phrases for distribution equipment. This research aims to develop a dictionary of terms that could be incorporated into NLP systems to improve performance across distribution applications. Moving forward, the work is expected to progress to more advanced implementations that support distribution asset analytics use cases.
For more information on the 2022 plan for asset, reliability & resiliency analytics, contact Doug Dorr or Bhavin Desai