Mobile applications have garnered a lot of attention in the last years. The computational capabilities of mobile devices are the mainstay to develop completely new application types. The provision of augmented reality experiences on mobile devices paves one alley in this field. For example, in the automotive domain, augmented reality applications are used to experience, inter alia, the interior of a car by moving a mobile device around. The device’s camera then detects interior parts and shows additional information to the customer within the camera view. Another application type that is increasingly utilized is related to the combination of serious games with mobile augmented reality functions. Although the latter combination is promising for many scenarios, technically, it is a complex endeavor. In the AREA (Augmented Reality Engine Application) project, a kernel was implemented that enables location-based mobile augmented reality applications. Importantly, this kernel provides a flexible architecture that fosters the development of individual location-based mobile augmented reality applications. The work at hand shows the flexibility of AREA based on a developed serious game. Furthermore, the algorithm framework and major features of it are presented. As the conclusion of this paper, it is shown that mobile augmented reality applications require high development efforts. Therefore, flexible frameworks like AREA are crucial to develop respective applications in a reasonable time.
Avoid common mistakes on your manuscript.
In everyday life, mobile augmented reality is increasingly utilized (Sánchez-Acevedo et al. 2018). However, the development of respective mobile applications is still complex. To mention only one example, existing mobile operating systems pose challenging differences and are frequently updated. Although the capabilities of modern mobile devices enable powerful mobile applications, the aforementioned aspects make the development of mobile augmented reality applications costly and time-consuming (Korinth et al. 2020). Hence, flexible frameworks are required. With the releases of ARkit Apple (2019) and ARCore Google (2020) by Apple and Google, it seems that also large vendors address the demands of mobile application developers by providing flexible frameworks. Interestingly, for other mobile application types, the trend to provide flexible frameworks is also increasing. For example, Apple has recently released a framework called HealthKit Apple (2020b) to develop mobile health applications more flexibly.
In the AREA (Augmented Reality Engine Application) project, a kernel was implemented that enables the robust implementation of location-based mobile augmented reality applications. In the light of the discussed demands, the kernel focuses on the following pillars:
For interested readers, in-depth information to the modular design principles of AREA can be found in Pryss et al. (2016, 2017a). In addition, the works (Schickler et al. 2015; Geiger et al. 2014, 2013; Pryss et al. 2017b) discuss in what way AREA deals with the peculiarities of the different mobile operating systems.
Importantly, this work extends (Schickler 2019). In the latter work, the algorithm framework of AREA was presented in more detail. This manuscript extends (Schickler 2019) by the following aspects:
The remainder of the work at hand is organized as follows: In Sect. 2, related works are discussed. Sect. 3 discusses the developed AREA algorithms, while Sect. 4 presents aspects of comparing AREA with ARKit from Apple. In Sect. 5, a feature is presented that enables the optimization of tracks and areas using AREA. In Sect. 6, the development of a serious game on top of AREA is presented and discussed. In Sect. 7, the practical use of AREA and its features based on the presented results are summarized. Finally, Sect. 8 concludes the paper with a summary and an outlook.
Previous research that is related to the development of location-based augmented reality applications in non-mobile environments can be found in Kooper and MacIntyre (2003). In Kähäri and Murphy (2006), mobile devices are used to develop an augmented reality system. The augmented reality application described in Lee et al. (2009) enables users to interact with media data through augmented reality. However, none of these approaches share insights pertaining to the development of location-based augmented reality on mobile devices as AREA does. Regarding tracks in mobile augmented reality, only little work can be found in literature. For example, the approaches (Vlahakis et al. 2002; Lee and Hollerer 2008; Hollerer 2004) present tracks as a key feature of (mobile) augmented reality applications. Interestingly, the algorithms to implement the tracks are not presented. In addition, no performance issues related to the developed track algorithms are discussed. Moreover, few contributions exist, which deal with the engineering of mobile augmented reality systems in general. For example, (Grubert et al. 2011) validates existing augmented reality browsers in the light of engineering aspects. As two recent examples, in Ren et al. (2019); Liu et al. (2019), edge-driven aspects for mobile augmented reality are presented. These works also show that engineering aspects are crucial for the development of mobile augmented reality applications. Kim et al. (2016), in turn, discusses various types of location-based augmented reality scenarios. More precisely, issues that have to be particularly considered for a specific scenario are discussed in more detail. Again, engineering aspects of mobile applications are not considered. In Yang et al. (2016), an authoring tool for mobile augmented reality applications, which is based on marker detection, is proposed, while (Paucher and Turk 2010) presents an approach for indoor location-based mobile augmented reality applications. Reitmayr and Schmalstieg (2003) gives an overview of various aspects of mobile augmented reality for indoor scenarios. Moreover, another scenario for mobile augmented reality is presented in Lee and Rhee (2015). The authors use mobile augmented reality for image retrieval. To summarize, (Yang et al. 2016; Paucher and Turk 2010; Reitmayr and Schmalstieg 2003; Lee and Rhee 2015) do not address engineering aspects of location-based mobile applications. In Chung et al. (2016), an approach supporting pedestrians with location-based mobile augmented reality is presented, while (Capece et al. 2016) deals with a client and server framework enabling location-based applications. Presently, new scenarios emerge, in which mobile augmented reality is investigated. Recent related works can be found that provide general overviews (Sánchez-Acevedo et al. 2018). Other related works deal with particular scenarios. For example, in Liou et al. (2018); Leighton and Crompton (2017); Tosun (2017), many examples related to education are discussed and evaluated. Further scenarios constitute tourism (Jung et al. 2018; Koh 2019) and crime management (Liao et al. 2018). A field, which is currently heavily pursued, constitutes mobile augmented reality in gaming scenarios (Rauschnabel et al. 2017; Litts and Lewis 2019; Laine and Suk 2019; Jang and Liu 2019). In medicine, mobile augmented reality becomes increasingly important as well (Ghandorh et al. 2017; Mladenovic 2019). Considerations on psychological factors when using applications in the reality–virtuality continuum are also being addressed more and more (Ibili and Billinghurst 2019; Hoppenstedt et al. 2019). In addition to this, the behavior of mobile users should be generally considered in future for any mobile application (Beierle 2019; Majeed 2019). Finally, in patents, engineering aspects are also covered (Hill et al. 2019). In conclusion, the engineering of mobile augmented reality applications is still a topical subject, for which AREA tries to be a contribution for the interested community.
Prior to the algorithm details, a brief introduction into the concept of AREA (Geiger et al. 2013; Pryss et al. 2017a) is given. Most importantly, AREA relates users, while holding their mobile device, to the objects detected in the camera view (i.e., Points of Interest (POIs), tracks, areas, 3D objects). In general, AREA is based on five pillars.
To enable these five technical pillars, the algorithms presented in Fig. 1 have been developed. In a first version of AREA (Pryss et al. 2017a), a Points of Interest (POI) algorithm was developed that showed considerable results (see Fig. 1, POI Algorithm v1). More detailed information to this algorithm can be found in Schickler et al. (2015); Geiger et al. (2014, 2013). However, as the computational capabilities of mobile devices have been continuously increased—and the practical requirements of the real-life projects as well— a new POI algorithm became necessary (see Fig. 1, POI Algorithm v2). All presentations in this work are based on POI Algorithm v2. Note that the technical aspects of POI Algorithm v2 can be found in Pryss et al. (2016, 2017a). Furthermore, in Fig. 1, all references are shown, in which the respective information of the algorithms (i.e., their listings and backgrounds) can be found. The following list summarizes the important aspects as well as new features shown in this work:
As an additional information of AREA, Listing 1 presents major settings used in the algorithms of the iOS version of AREA. Note that the same settings are managed on Android. Radius, minRadius, maxRadius, maxPOIVisible, cameraFieldOfView, compassBetterOnlyPortait, radiusPicker, areaIsModal, and sharedInstance are used by the algorithm that positions POIs correctly. useGoogle, googleAPIKey are used for testing purposes. If POIs cannot (or not being wanted to) be loaded from a remote database, this feature can be used to gather data from Google. Finally, poiClustering, horizontalClusterWidth, and verticalClusterHeight are used to handle POI clusters.
Since Apple recently released ARkit Apple (2019), this section shall enable researchers to directly compare the features and function of the iOS version of AREA with ARkit. In general, ARkit was developed by Apple with the goal to provide a multitude of mobile augmented reality applications. Currently, all features developed in AREA pursue the goal to enable location-based mobile augmented reality applications. To be more precise, the location is based on GPS coordinates and therefore AREA mainly aims at outdoor location-based augmented reality applications. In ARkit, also many other features like face tracking are provided. Consequently, ARkit aims at a broader perspective on mobile augmented reality applications. However, from the technical perspective, it might be of interest to directly compare the functions of the iOS version of AREA with ARKit. In ARkit, the chain of classes \(ARSession>ARFrame>ARCamera\) must be used to enable a location-based mobile augmented reality experience. To be more precise, the class ARSession must be used to handle the sensor fusion, while the classes ARFrame and ARCamera must be used to handle the positioning of POIs. Regarding the ARSession class, compared to AREA, a developer must manually add GPS data to the sensor fusion. With ARkit, compared to AREA, a developer is relieved from directly reading data from the device’s motion sensing hardware. Another interesting comparison is related to the ARCamera class of ARKit. By using the ARCamera class, the correct positioning of POIs can be accomplished. Therefore, the relevant components of the ARCamera class (Apple 2020a) can be compared to the iOS version of AREA as shown in Table 1.
Second, Listing 3 presents the listing that can be compared to the func projectionMatrix of ARKit.
Third, Listing 4 presents the listing that can be compared to the func projectPoint of ARKit.
Currently, performance experiments are conducted to compare ARKit with AREA. In general, the provision of ARKit emphasizes that mobile augmented reality has become an important mobile application type. In line with ARKit, the application of AREA in practice revealed that features enabling mobile augmented reality applications beyond location-based outdoor scenarios are promising. Therefore, the authors of the work at hand work on new features like, for example, the recognition of objects in AREA. Furthermore, AREA is currently compared with ARCore Google (2020) from Google.
In the real-life projects Footnote 1 for which AREA is utilized, the display of many tracks and areas with the algorithm shown in Pryss et al. (2017b) revealed performance issues. Therefore, a feature on top of this algorithm was implemented in order to cope with demanding scenarios. It is only shown how the optimization is implemented in Android and for tracks; however, areas and the implementation on iOS are performed using the same principles. In general, a track is displayed by the use of bars. Between each bar, a distance of 1 m ( \(m~=~meter\) ) is used. Having this in mind, a track of 1km (km=kilometer) requires roughly 501 bars. Each bar, in turn, is represented by two triangles. Each vertex of a triangle has 3 coordinates (x, y, and z), and a RGBA value (i.e., r, g, b, and a components). The three coordinates as well as the 4 RGBA components require 4 bytes. Having these values also in mind, reconsider the track of 1km. This track would require \(501\cdot 2\cdot 3\cdot (3+4)\cdot 4 = 84168~bytes\) to store it. If many tracks or areas shall be displayed, this affects the performance based on the required data to be stored and calculated. To increase the performance in the case that many tracks have to be displayed (or/and areas), the general idea is to manage a detail level for tracks (and areas). Based on this detail level, tracks and areas can be displayed in different resolutions. The notion of the resolution and how it is calculated are shown in the following.
As the first step, the required preliminary calculations are presented. As a first step, consider a list checkpoints containing n vectors (x, y, and z). Each vector n in checkpoints represents one point on a track that shall be displayed. Furthermore, the checkpoints list stores the values in an ordered manner according to a track. In addition, three further lists are managed: degreesY, degreesXZ, and pairs. Each of these lists stores \(n-2\) values. More specifically, the values of the three lists store the following:
To determine degreesY and degreesXZ, the following calculations are applied:
Based on the shown lists, the size of a track can be decreased with respect to so-called detail levels. A detail level reduces tracks (and areas) to x track points. Reduction means that the originally defined amount of track points n is reduced to x. The reduction, in turn, is calculated as follows:
The implementation of the algorithm to reduce the number of track points is shown in Listing 5. Note that the listing only shows the Android version (the iOS version works accordingly).
AREA manages seven detail levels. To be more specific, the tracks (and areas) are displayed using these levels, depending on the distance a user has to them. The detail levels are distinguished by the number of track points to which the track is reduced to by Algorithm 5. The levels, in turn, are managed as shown in Table 2.
As already discussed in Sect. 2, the combination of mobile augmented reality and serious games has recently garnered a lot of attention (Rauschnabel et al. 2017; Litts and Lewis 2019; Laine and Suk 2019; Jang and Liu 2019). In the real-life projects of AREA (CMCityMedia 2020), this was also a frequent demand. Therefore, the development of an implemented serious game on top of AREA is presented. The game is denoted with ARGame. In detail, the technical aspects of the ARGame, the required extensions of AREA, and impressions of the ARGame are presented. With ARGame, it can be shown that the flexible design of AREA enables new applications on top of it in a reasonable development time.
Being a decisive aspect, the operating principle of ARGame is delineated in the following. In this context, note that AREA is commercially utilized for city apps (CMCityMedia 2020). In these apps, users have the opportunity to experience many aspects of a city. With the presented ARGame, the experience shall be enhanced. To be more precise, a citizen shall be enabled when using ARGame to find avatars. These avatars are geo-tagged, and if a user is within a radius of 10m to an avatar, then the latter is displayed in the camera view of a user. Figs. 4 and 5 illustrate how these avatars appear on Android and iOS.
If the user clicks on the avatar, then it disappears and the user gets credits for having found the avatar. Credits, in turn, can be used to be redeemed in the participating stores of the respective city for which the app was developed. The overall idea is that the attention of users is attracted to important points in the city. By using this principle, users learn more about the city, on the other, they are awarded. Fig. 6 illustrates received points of a user. One more feature is provided by the ARGame, which is shown in Fig. 7, it offers contextual information. The latter shall support users in playing ARGame.
Next, it is discussed how ARGame was technically integrated into the existing AREA core. Note that the technical integration is discussed based on the Android implementation as the differences to iOS are only very slightly.
If AREA is started by a user, then the activity denoted with Area20MainActivity is called. The respective onCreate method is shown in Listing 6.
In the context of the onCreate method, the getFragment method is important, which is shown in Listing 7.
Based on the content of the config object, AREA decides which augmented reality function shall be utilized. Notably, Area20MainFragment comprises all algorithms, which have been summarized in Fig. 1. In this case, AREA shows Points of Interest, Tracks, and Areas. As the second option, Area20GameFragment means to start the ARGame. A third option, which is currently implemented, is called Area20MarketingFragment. This feature shall provide a chatbot, which appears in front of geo-tagged shops to help customers before actually entering the shop.
In the Area20MainFragment, the classic AREA functions are summarized. As already described, these are the features to display POIs, tracks, areas, and the radar. Notably, the Area20GameFragment inherits from Area20MainFragment, and mainly uses the transformation of coordinates and the radar feature. Due to the reason that in Area20MainFragment solely 2D objects are used, but 3D objects are additionally needed in Area20GameFragment, the rendering feature had to be changed for Area20GameFragment. This includes adding new functions that are shown in Figs. 4, 5, 6, and 7. Finally, the required calculations of the positions of the avatars had to be changed for Area20GameFragment.
Technically, the changes are accomplished as follows. After initializing the Area20MainFragment, the method denoted with initViews is called, which is shown in Listing 8. In this method, the radar and the button to terminate AREA are initialized. These functions are also used by the ARGame. In this method, it is also evaluated whether or not the app is allowed to use the camera function of the mobile device. Furthermore, the method initSurfaceView is initialized, which is important for the rendering of the avatars. For ARGame, the method initSurfaceView is overwritten for the rendering of avatars. In Listing 9, the method, which was overwritten, is presented. Importantly, the object, which is denoted with sensors, corresponds to the same object as in Area20MainFragment to calculate the correct and current position of the mobile device.
To position avatars within the radar, the function didUpdateLocation of Area20MainFragment is reused, which is shown in Listing 10. With the method setUpSourroundingPointsOfInterest, those avatars are determined, which are within a radius of 400m to the user. These avatars will be then displayed within the radar in the same way as displayed by Area20MainFragment.
Prior to the presentation of the procedure to initialize and start the game, Fig. 8 summarizes all classes of AREA, including those that are needed for ARGame. Most importantly, the modular design of AREA has enabled the flexible integration of ARGame.
Finally, the overall procedure of ARGame is presented. After the AREA20GameFragment is displayed, a GET request is sent to the server of the company of the city apps (CMCityMedia 2020) to eventually obtain the available games. Listing 11 shows such a request. Each ARGame has an ID, with which the available avatars for an ARGame are specified (see Listing 12)
Then, it will be identified whether or not a user is already subscribed to ARGame. If not, then all avatars are loaded. If the user is already subscribed for ARGame, only those avatars will be loaded that were not found by the respective user so far. The loading procedure of the avatars is done by the method addARTarget (see Listing 13).
ARTargets are handled by AREAPointOfInterest in Area20MainFragment. They are sent to the poiStore by the Area20MainFragment in order to finally display the ARTargets on the radar. After that, the ARTargets are handed over to the ARGamingSurfaceView method (see Listing 14), which, in turn, hands them further over to the ARGamingRenderer
After handing over the ARTargets to the ARGamingRenderer, the ARTargets (i.e., the avatars) are rendered and positioned based on their GPS coordinates. Note that the utilized textures are currently hardcoded within the ARGamingRenderer.
Finally, to get a better impression how the loading procedures of the classes Area20MainFragment and AREA20GameFragment differ, the two sequence diagrams in Figs. 9 and 10 show both procedures in detail. Fig. 9 shows it for AREA20GameFragment, while Fig. 10 presents it for Area20MainFragment.
It is further shown in Figs. 9 and 10 that the general extensions of AREA for the ARGame were flexibly possible.
Currently, AREA is used in various scenarios in everyday life (CMCityMedia 2020). Three aspects are particularly important for this extensive use. First, the algorithm framework shown in this work (see Fig. 1) was bundled into the AREA kernel, including its modular architecture (Pryss et al. 2017a, b). Following this, the development of business applications on top of AREA becomes easily possible. This positive characteristic was mainly shown in this work based on the development of ARGame. Second, AREA reveals a proper user experience with respect to robustness and performance. The achieved robustness was also an important pillar for the development of ARGame. Experiments conducted with AREA (Pryss et al. 2017a, b) confirm that it is competitive to mobile applications that provide the same or a similar feature set at the time of conducting the experiments. Third, AREA provides the same feature set on Android and iOS. The ability to cope with the peculiarities of the different mobile operating systems, while providing the same features on all of these mobile operating systems, is highly welcome in practice. However, to keep pace with the frequent updates of the underlying mobile operating systems on one hand, and to continuously implement new features that emerge in practice on the other hand, is still a very challenging endeavor. Therefore, insights into frameworks and operating principles as shown in this work are of utmost importance. Nevertheless, in future experiments, AREA must prove its performance compared to ARKit Apple (2019) and ARCore Google (2020). In general, performance measures are currently missing for two other important aspects. First, it must be quantitatively measured how developers consider AREA compared to other solutions in terms of time and working costs. Second, it must be measured in more experiments, how AREA competes with existing other frameworks as well as other existing mobile applications. In the light of these limitations, the utilization of AREA in commercial scenarios shows its feasibility in practice.
This work provided insights into the development of the AREA framework. As the first contribution, a comprehensive overview of the implemented algorithms to enable location-based mobile augmented reality applications was presented. On top of this, the development of ARGame was presented, a serious game, which is used for commercial purposes. In general, the development of mobile applications is demanding when considering the peculiarities of the different mobile operating systems on the market. To cope with this heterogeneity, AREA provides the same functionality for business applications that are developed based on top of it for Android and iOS. This is enabled by enclosing all features in the AREA kernel. To show how this is technically accomplished, all steps to integrate ARGame into AREA were presented. Following such implementations, application developers can utilize AREA like ARKit from Apple or ARCore from Google to easily create their own location-based mobile augmented reality applications. Furthermore, as this work provides implementation details, this may help the community in using the insights for further developments and improvements. In this context, it was also shown that frameworks like AREA should be quantitatively assessed. Hence, AREA has been evaluated in experiments (Pryss et al. 2017a, b). These experiments have shown that AREA reveals considerable performance results compared to other mobile augmented reality applications providing a similar feature set. Furthermore, it was reported that currently conducted experiments investigate how AREA competes with ARKit and ARCore. Another experiment investigates a new feature that was developed on top of the track and area algorithm. Altogether, mobile augmented reality applications can support many new scenarios and fields. However, powerful solutions that can be easily developed across the different mobile operating systems in the same way are becoming more and more important. Finally, considerations on psychological factors should be also taken into account when using applications in the virtuality-reality continuum like AREA. Therefore, such investigations are pursued in future work on AREA.
Open Access funding provided by Projekt DEAL.