[arXiv] BigDataFr recommends: Statistical Challenges of Big Brain Network Data

BigDataFr recommends: Statistical Challenges of Big Brain Network Data […] Subjects: Neurons and Cognition (q-bio.NC); Methodology (stat.ME) We explore the main characteristics of big brain network data that offer unique statistical challenges. The brain networks are biologically expected to be both sparse and hierarchical. Such unique characterizations put specific topological constraints onto statistical approaches and […]

[arXiv] BigDataFr recommends: Improving Viability of Electric Taxis by Taxi Service Strategy Optimization

BigDataFr recommends: Improving Viability of Electric Taxis by Taxi Service Strategy Optimization: A Big Data Analysis of New York City […] Subjects: Computers and Society (cs.CY) Electrification of transportation is critical for a low-carbon society. In particular, public vehicles (e.g., taxis) provide a crucial opportunity for electrification. Despite the benefits of eco-friendliness and energy efficiency, […]

[Datasciencecentral] BigDataFr recommends: Representation of Numbers with Incredibly Fast Converging Fractions

BigDataFr recommends: Representation of Numbers with Incredibly Fast Converging Fractions […] Here we discuss a new system to represent numbers, for instance constants such as Pi, e, or log 2, using rational fractions. Each iteration doubles the precision (the number of correct decimals computed) making it converging much faster than current systems such as continued […]

[arXiv] BigDataFr recommends: Amplifying Inter-message Distance: On Information Divergence Measures in Big Data

BigDataFr recommends: Amplifying Inter-message Distance: On Information Divergence Measures in Big Data […] Subjects: Information Theory (cs.IT) Message identification (M-I) divergence is an important measure of the information distance between probability distributions, similar to Kullback-Leibler (K-L) and Renyi divergence. In fact, M-I divergence with a variable parameter can make an effect on characterization of distinction […]

[arXiv] BigDataFr recommends: Visualization of Big Spatial Data using Coresets for Kernel Density Estimates

BigDataFr recommends: Visualization of Big Spatial Data using Coresets for Kernel Density Estimates […] Subjects: Human-Computer Interaction (cs.HC); Computational Geometry (cs.CG) The size of large, geo-located datasets has reached scales where visualization of all data points is inefficient. Random sampling is a method to reduce the size of a dataset, yet it can introduce unwanted […]

[arXiv] BigDataFr recommends: A European research roadmap for optimizing societal impact of big data on environment and energy efficiency

BigDataFr recommends: A European research roadmap for optimizing societal impact of big data on environment and energy efficiency […] We present a roadmap to guide European research efforts towards a socially responsible big data economy that maximizes the positive impact of big data in environment and energy efficiency. The goal of the roadmap is to […]

[arXiv] BigDataFr recommends: Massively-Parallel Feature Selection for Big Data

BigDataFr recommends: Massively-Parallel Feature Selection for Big Data […] We present the Parallel, Forward-Backward with Pruning (PFBP) algorithm for feature selection (FS) in Big Data settings (high dimensionality and/or sample size). To tackle the challenges of Big Data FS PFBP partitions the data matrix both in terms of rows (samples, training examples) as well as […]

[arXiv] BigDataFr recommends: Strategies for Big Data Analytics through Lambda Architectures in Volatile Environments

BigDataFr recommends: Strategies for Big Data Analytics through Lambda Architectures in Volatile Environments […] Expectations regarding the future growth of Internet of Things (IoT)-related technologies are high. These expectations require the realization of a sustainable general purpose application framework that is capable to handle these kinds of environments with their complexity in terms of heterogeneity […]