The recent increase in new transportation data sources coupled with the enhanced densities of data Sentiment Analysisgraphicsources are leading to a number of new analytics tools for transportation professionals.  One of the latest analysis resources to recently emerge fuses real-time/near-real time social data with traditional mobility data (speed, location, trajectory).  Human sensor networks wield a much more comprehensive ability to not only report on traditional mobility data attributes, but also provides additional resolution for location and time, as well as characteristics with regards to “sentiment”.

Sentiment analysis  aggregates and filters real-time data from the web and social media resources and reduces the data for context and transportation value.  Sentiment mapping links data sources with location, or future locations with detected sentiment related to each location.     For example, data crawlers and filtering for key words (accident, crash, traffic, I-66, I-95) and assessing sentiment tied to messages (slow, clear, backup, bad, good).  The resulting sentiment model is then tied to time and location data for a resulting sentiment graph.  Sentiment mapping and sentiment analysis can also be utilized for predictive analytics where content is determined to identify future location-based sentiment for future conditions.

The following links provide a primer as well as preliminary information on development and configuration of sentiment data models.

References and Resources

How Social Media Can Improve and Redesign Transport Systems
http://www.theguardian.com/sustainable-business/social-media-redesign-transport-systems-cities
Transportation Sentiment Analysis for Safety Enhancement
http://utc.ices.cmu.edu/utc/CMU%20Reports%202013%202/Final%20Report%20Chen.pdf
Creating a Sentiment Analysis Model
https://developers.google.com/prediction/docs/sentiment_analysis
Sentiment Analysis
http://en.wikipedia.org/wiki/Sentiment_analysis
How Smart Cities are Using API’s
http://www.programmableweb.com/news/how-smart-cities-are-using-apis-public-transport-apis/2014/05/22

A lot has been written about “big data” lately.  The rapid growth of varying data sources coupled with the enhanced density in data sources is establishing  a huge resource for transportation operators.  The rapid proliferation of data sources from new devices such as smartphones and other newly connected devices, in conjunction with the advancement of technologies for data collection and management have manifested a sizeable inflection point in the availability of data.  So what does this mean for ITS operators and the systems they currently manage?  What will be required to extract and leverage values associated with “big data”?

At First Glance

Federal regulations for performance measures and real-time monitoring associated with MAP-21 and 23 CFR 511 have implemented a framework for the increased need of new, refined data and information systems.  System enhancements will require improvements to existing networks and communications systems in order to optimize data and metadata flows between data sources and central applications. Robust central network equipment, including L3 switches, servers and storage will also be required.  Enhanced security measures  associated with new data sources and big data values will also need to be reviewed and attended to.  New central data warehouse infrastructure will also be required, including new database applications (such as Hadoop), that are capable of managing “big data” and the “Internet of Things” (IoT).

Deeper Dive

A closer look reveals additional layers of change required in order to begin abstracting value from the new data sources.  “Big data” will also require somewhat less obvious changes in the way transportation agencies currently do business.

Increased Data Management and Analytics Expertise –  The new data paradigm will require new staff skills, most notably, experience in data analytics (Quants).  Staff skills will not only require knowledge of the data available now or potentially available in the near term, but also understand transportation systems in order to apply the most beneficial data mining tactics available.  The new role must not only be aware of current data and information needs and values, but also be cognizant of what is capable, and potential hidden values currently unrealized or unknown by an operating agency.  The new role will also be an integral part of the development of embedded system features and be able to identify nuances in data meaning, as well as establish effective predictive analytics.

Policy and Digital Governance –  New data sources are also giving rise to discussion regarding privacy and liability.  Data sourced from private entities will always contend with privacy fears and concerns, at least for the near term, although recent analysis is showing a steady lessoning of those fears as “digital natives” begin to represent a greater percentage of the traveling public.  Data generated from sources outside of transportation agencies, but utilized by transportation agencies  for systems operations, can lead one to question who is responsible should data errors occur that might affect a system.

Networks and Communications - Data sources, formats and general data management practices will need extensive review of existing conditions. What values are attained from real-time, or near real-time collection from subsequent analytics, as well as determining what data is less time dependent.  Existing formats and protocols should also be included in the mapping exercise. For example, CV will require a mandatory upgrade of IP protocols from IPv4 to IPv6.  General planning regarding the utilization of “the cloud” need to be weighed for benefit-cost.  Third-party data brokers and other outsourcing alternatives such as cloud computing need to also be assessed.

Data Management and Analysis Tools – Operating entities also need to look at implementing data management tools (applications) that will assist in extracting value from large data sets.  These tools  should be integrated with core systems, and provide real-time metrics of collected data.  The tools should also provide the ability for “Cloud collaboration”, in order to process data stored by third parties, or general data stored in the cloud.

Wisdom Knowledge Information Data Pyramid

What to do

Transportation budgets are as tight as ever. How can operating agencies begin to make incremental steps towards the goal of realizing benefits associated with “big data”?  The first step is to begin now.  Start by mapping existing data sources to existing data management technologies, policies and processes, from end to end.  Also, widen your perspective and begin to look at possible benefits from a wide array of new data sources.  In addition, “open” it up, and benefit from the wisdom of the crowd.  New analytics skill sets should be considered a condition of certain new hires in the transportation and ITS planning departments.  A staff member should be designated for leading the way with decisions regarding “big data”, relationships with third party data brokers, cloud management, as well as be responsible for implementing an agile framework for next-gen data systems.

References and Resources
Developing a Data Management Program for Next-Gen ITS: A Primer for Mobility Managers
Big Data and Transport
TransDec: Big Data for Transportation
Update from the Data Liberation Front                                                                                           

The ITS planner, designer and operator should always be cognizant of the life-cycle of the overall system and its integrated subsystems and components. Timing of next-gen ITS integration can be optimized, both fiscally as well as technically, by considering the wide spectrum of variables associated with life-cycle management.  The following graphic presents a general overview of a typical systems life-cycle:

Life-Cycle_Management

Typical life-cycle management should also include evaluation of the maturity of next-gen ITS technologies and the systems required to support a new ITS solution.

techadoptcurve

With the pending market saturation of the smartphone looming, and the emergence of connected vehicles, peer-to-peer resource management, crowdsourcing and the implementation of collaborative platforms, one could easily surmise that the “consumerization” of significant components of Next-Gen Intelligent Transportation Systems (ITS) is well underway. What is not exactly clear is what the landscape will look like during the transition, as well as when consumerization is firmly rooted.

It’s clear that public mobility managers will continue to provide certain services to their constituents for the foreseeable future, however  it is expected that some existing services will be provisioned through consumerization. Consumerization will also give rise to entirely new service needs. New areas of expertise will be required for data and information management, systems management and X2X networks, to name just a few. Will consumerization lead to less strain on agency coffers? Or will it simply generate new needs equal to or greater than existing financial burdens? We’ll take deeper dives on these issues in coming posts.

consumerization

References and Resources
http://blog.gardeviance.org/2011/03/consumerization.html

Sources of Innovation

Posted: May 15, 2013 in Innovation, Planning
Tags: , ,

One of my favorite aspects about the technology industry is that the “next big thing” can come from just about anywhere and at anytime.  Intelligent Transportation Systems (ITS) envelopes a significant array of core industries, and feeds off of quite a few other innovation ecosystems.  However, monitoring and tracking these future trends and relevant upstream currents can be challenging at best.

One of the first tools I developed was an innovation resource matrix, which attempts to map core and key industries and innovation resources integrated with or tangent to the ITS industry. An example of this first generation map is provided in the following graphic. In later posts we’ll examine philosophies and strategies for managing and navigating the hype, fog and moats associated with trends analysis and future-casting.

Capture

There is no denying “big data” and its importance to next-gen ITS applications. The emergence of a vast,data omnipresent data cloud is enabling new knowledge and wisdom to be attained, as well as facilitate new operations models for the mobility manager.  Unfortunately, parochial data systems and data management strategies are quickly becoming obsolete with regards to managing this quickly evolving paradigm.  As a result, the need for operating institutions and mobility managers to understand “big data” and implement new, comprehensive and overarching data management strategies has never been greater.  Next-gen data and information systems will need to be autonomous, contextual, predictive and real-time.  The overall impact is cascading in that now a new strategy is not only desired, but will become an essential function, as the proliferation of meaningful data sources accelerates.  The time for agencies to plan, prepare, implement and transition is now.  The following aggregates a few thoughts into an introductory package for agencies to consider as they get started, in hopes of widening the road to success.

THE NEED

Although all of the values of new “big data” resources are not yet fully understood, the danger of getting bogged down in the data deluge is already being felt.  Before these new values can be leveraged, we must first review, research and retool, predicated on a sound understanding of existing conditions and extensive research and evaluation of likely future conditions and future capabilities. In addition, programmatic and industry changes such as MAP-21 and the Connected Vehicle are changing the operational fabric and are mandating new requirements for mobility managers, and thus, also need to be considered when developing a new data and information management strategy.

WHAT / HOW MODEL

So where to start? – The following insights are framed within the “What/How” solutions model, or “What do we want/need?”, and then,”How do we do it?”  As is the case with all sound planning efforts, an accurate understanding of existing conditions is an essential first-step prior to commencing with future planning efforts.

data2

What

Stakeholders and Champions – The first step is to identify all possible stakeholders (including champions and arbiters), both internal and external to an operating entity.  It’s key to remember that the data paradigm shift will cover all departments, agencies, programs and offices within a city and/or region, therefore coordination with an overarching perspective is essential for success.  Typically non-traditional stakeholders will now play important roles and become key teammates.  The identification of the initial list of stakeholders should include a first draft of a new steering committee or “Data Management Team” (DMT), which should encompass all pertinent agencies and institutions.

What do we have?

Following the formation of DMT, the team should begin to assess existing conditions.  Some key questions to get started include:

  • What are our existing data generators?
  • What systems are required to support these data generators?
  • How do we currently source, transmit and aggregate data from existing data sources?
  • What data and information-based goals and objectives are currently in place?
  • What are our existing processes for measuring and monitoring the path towards prescribed goals?
  • What values are we realizing/not realizing?
  • What standards and formats do we utilize?
  • What policies and regulations currently exist?
  • What quality control processes and procedures are in place?
  • What licensing, warranty and policy factors impact our data and information systems?

These questions will likely uncover significant new understanding as to how an agency currently handles data, and identify opportunities lost or new opportunities for functional improvements. The baseline assessment needs to include identification and mapping of existing supporting systems and infrastructure, including networking and software applications.  The exploration should also begin to drill down and refine existing information such as data attributes. A list of attributes might include:

Data

  • Source
  • Owner
  • Use rights
  • Format
  • Polling rates
  • Current uses (realized)
  • Potential uses (unrealized)
  • Quality
  • Cleansing/conditioning

Data Support Systems and Applications

  • Infrastructure requirements
  • Software dependencies
  • Other OSI reference model considerations

Policies, Guidelines and Contracts

  • Use policies
  • Cost per byte/poll
  • Licensing and Warranties
  • Existing vendor contracts, limitations
  • Storage and Retrieval
  • Performance metrics and monitoring
  • Existing staff requirements

Interim Review – Immediately following initial exploration of existing conditions, the Data Management Team should conduct an interim review of its findings. In addition, the DMT should review any and all existing goals and objectives related to data and information systems. What are we truly trying to accomplish and what are we achieving? What are we not achieving? What are the perceived initial gaps?  The initial review of existing conditions will likely trigger additional exploration needs with regards to existing data and information systems.  The interim review will also likely uncover additional stakeholders, both internal and external to the mobility management ecosystem.

Mapping – Map your findings.  As with all good wayfinding processes, a “you are here” marker is essential.  The goal is to map all exploration activities and contextualize the existing data and information system landscape.  In addition to narrative and graphical mapping, a spreadsheet or database is also helpful for tracking results such as data and information attributes.

Projections and Forecasts

data4The next step will be to begin exploration and research of existing trends and to conduct forecasting of future trends and forecasted conditions.  Predicting the future is always challenging at best.  However, with a sound, comprehensive strategy in place, an organization can best plan and implement strategies that prepare an agency for potential future conditions.  Trends analysis and future conditions forecasting will assist in establishing a pragmatic orientation for the foreseeable future.  These assessments should be conducted in parallel, yet separate paths from the existing conditions exploration and mapping tasks.  (The simultaneous work efforts will assist in finalizing the existing conditions survey task by uncovering additional gaps in the initial existing conditions survey and identify additional existing conditions research required).

Current Trends – Current trends such as cloud-computing, smartphones, mobile apps, private data sourcing, crowdsourcing, and integrated corridor management (ICM) need to be identified and included in new data management strategies. MAP-21 and other Federal requirements will mandate a new minimum acceptance level for the operating entities and also need to be immediately included in planning efforts.  It’s important to look past today’s sheen of certain applications and technologies to truly understand where industries and agencies are headed.

Future Trends – Connected Vehicle, including V2X, or V2I components will directly impact operating agencies and the way they do business in the coming years. Other likely future trends such as the autonomous vehicles, City as a Platform and integration of transportation networks will directly impact the data and information framework.  Additional trends such as system automation and data driven systems will amplify the need for pertinent real-time data.

Research

The “Future-Casting” task should also assign segments of industry to in-house champions (domain expertise), in order to monitor federal regulations, funding streams, the information technology and automobile sectors, university, state and federal research tracks, consumer technology markets, as well as tangential markets and adjacent internal agencies and divisions.

What do we want/need?

Immediately following the initial existing conditions survey and research and forecasting of future trends and conditions, the DMT should revisit original goals and objectives regarding data and information systems, and modify/append accordingly.  At this point, a traditional “User Needs and Preferences” assessment can be conducted, and should follow a traditional Systems Engineering framework. Some of the basic questions to address include:

  • Have we properly identified and defined all of our goals and objectives
  • How do you plan to leverage enriched data environments?
  • How will this foster enhanced wisdom and adaptive genius within our mobility ecosystem?
  • How will me monitor our progress towards achieving our goals and objectives (performance measures)
  • Have we instituted agency changes appropriate and sufficient to meet our goals and objectives?

To this point, you should have a pretty sound understanding of all of the existing data and information systems within the agency/region.    However, it may require additional iterations of the exploration, mapping and wants and needs assessments to truly understand where you are, and where you want to be (goals).

How

Once goals and objectives have been set, we can begin to assess “How” do we get there?  As with most planning efforts, an alternatives analysis and a Long Range Plan and Implementation Plan need to be developed.  A scale vs. value and ROI assessment is conducted at this point as well.  As is always the case with future-proofing, the key is not to plan to design for specific (undefined, and in some cases unknown) technologies, methodologies and strategies, but to identify and anticipate likely future conditions and implement a framework that is agile, flexible and capable of embracing future technologies, strategies and methodologies.

data3The next step is to establish a requirements-based blueprint and roadmap to transition from today to tomorrow. It’s also important to set measurable goals and identify necessary performance metrics in order to track progress towards goals and objectives, and to be able to conduct evaluatory assessments.  This step should also include a traditional gap analysis as well.  The Long Range Plan should also include a Concept of Operations.  This step will also begin to define “rewiring” necessary for executing the new data and information management program, which should also include business rules.  In addition, new data management schema needs to be integrated with the overall (typical) planning processes, including budgeting, long-range plans and regional plans.

Staffing resources and annual operations should also be assessed at this point.  Domain expertise, staffing and skills requirements will need to be addressed.  This should be included in the initial existing conditions exploration.  A new Data Manager position is likely the most appropriate first hire.  This individual may be an MPO, DOT or local agency staff person in charge of overseeing all harmonization of data and information systems across all platforms, jurisdictional and agency boundaries.  A Data Scientists/Analysts will also likely be required.

Additional Challenges and Potential Impediments to Consider

Initial Buy-in and Engagement – As with most new initiatives, getting up from the “comfy couch” can be the biggest challenge to implementing new or improved strategies.  Generating the initial inertia and momentum will require champions at the administrative, technical and arbiter levels, within all stakeholders, departments, agencies and regional staff (MPO).

Data use and retention policies – some data may be approved for certain uses, however, additional uses may raise privacy, licensing or ownership issues.  This challenge also gives rise to additional hurdles including operational governance and regulation of the new data and information system.  For example, can private data be sourced to operate public systems (signal systems, etc.) were safety is critical?

Integration and Standardization – what level of data and system integration is optimal, or will achieve the greatest Benefit/Cost ratio for an operating entity? What granularity and resolution (data density) is required for each component of the goals?  Automated monitoring and performance reporting will be a key to success with regards to overall integration and standardization.

Sustainability – A new funding stream (outgoing) is likely required.  However, the potential for additional revenue streams (incoming) is also likely.  Funding needs to be identified for the initial capital outlay, as well as annual operations and maintenance cost for the life-cycle of the system and subsystems.

Security – As the data reservoir expands, and the network to support and manage the data and information systems expands, so will the security concerns.  New policies and data management applications will be essential. Data storage, encryption, access rights, use rights as well as infrastructure and support applications should all be included in the initial security assessment and security planning efforts.

RESOURCES
Transportation Data and Information Systems – LinkedIn Working Group
http://www.linkedin.com/groups?gid=4929972&trk=myg_ugrp_ovr
USDOT Research Data Exchange
http://www.its.dot.gov/assetviewer/
Research, technology, and data drive America’s transportation system – USDOT Transportation Secretary
http://fastlane.dot.gov/2013/03/researchg.html
Real-Time Data Capture and Management
http://www.its.dot.gov/data_capture/data_capture.htm

I originally investigated the potential synergies and resultant barriers and challenges for integrating public/commercial grade wireless communications in support of the “Connected Vehicle”, back in 2007. (then known as “Intellidrive”)  A brief summary of the initial findings and a proposed architecture was presented in a blog post in August, 2011.  Since the initial posting, the Connected Vehicle ecosystem has started to take shape and is gaining significant momentum on multiple fronts, including amongst the automotive and telecommunications industries, as well as the Federal Government.  As a result,  I thought it might be of some value to revisit and update the hybrid communications framework originally proposed for the Connected Vehicle.

DSRC-Cell_DiagramCurrent Values

The primary attractiveness of commercial cellular continues to be maturity of technology and network coverage, including for most major urban areas, suburban areas and even significant coverage of rural areas.  Dedicated Short Range Communications (DSRC) is currently limited to approximately 1200 feet, line of sight, and will require significant investment in new infrastructure. Commercial wireless and Wi-Fi technologies continue to show promise for providing secondary, tier-two services associated with the Connected Vehicle.

Current Barriers and Limitations

Substantial limitations still remain. The prevailing barrier is communications latency with regards to minimum requirements associated with V2V and V2I. In addition, commercial cellular networks remain vulnerable to network congestion issues (peak periods), including denial of service and dropped calls.    Also, cost remains a significant hinderance, as the Federal Government has taken the stance that automotive safety should be free to the end-user.

Additional Resources

 The Battle Between Cellular and DSRC – Panel Discussion from Telematics Update

http://uk.telematicsupdate.com/fc_tele_evupdatelz/lz.aspx?p1=05119912S3022&CC=&p=1&cID=0&cValue=1

Hybrid Communications Network for the Connected Vehicle

http://terranautix.com/2011/08/14/hybrid-communications-network-for-the-connected-vehicle/

Connected Vehicle Insights – Fourth Generation Wireless – Vehicle and – Highway Gateways to the Cloud

http://www.its.dot.gov/research/pdf/Tech%20Scan%204G%20Wireless%20(final)%20.pdf