Edited, memorised or added to reading queue

on 27-Jun-2024 (Thu)

Do you want BuboFlash to help you learning these things? Click here to log in or create user.

#feature-engineering #lstm #recurrent-neural-networks #rnn
Feature engineering has been used broadly to refer to multiple aspects of feature creation, extraction, and transformation
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
In machine learning, a feature refers to a variable that describes some aspect of individual data objects (Dong & Liu, 2018). Feature engineering has been used broadly to refer to multiple aspects of feature creation, extraction, and transformation. Essentially, it refers to the process of using domain knowledge to create useful features that can be fed as predictors into a model.

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7642445647116

Tags
#causality #statistics
Question
A graph is a collection of nodes (also called “vertices”) and [...] that connect the nodes.
Answer
edges

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
A graph is a collection of nodes (also called “vertices”) and edges that connect the nodes.

Original toplevel document (pdf)

cannot see any pdfs







#feature-engineering #lstm #recurrent-neural-networks #rnn
Similar to our studies, we could encode brand choices and the decision environment as we encoded solicitations and donations: as a multidimensional vector.
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
choices can be modeled as sequential choices, and data complexity increases exponentially with the number of brands (with interaction effects), LSTM neural networks offer suitable alternatives. <span>Similar to our studies, we could encode brand choices and the decision environment as we encoded solicitations and donations: as a multidimensional vector. We conjecture that testing the performance of LSTM neural networks in the context of brand choices would constitute an exciting replication area. <span>

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7642449579276

Tags
#RNN #ariadne #behaviour #consumer #deep-learning #priority #recurrent-neural-networks #retail #simulation #synthetic-data
Question
In this paper, we show that recurrent neural networks (RNNs) are promising to overcome both [...] of vector-based methods, tedious feature engineering and lack of explainability.
Answer
shortcomings

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
In this paper, we show that recurrent neural networks (RNNs) are promising to overcome both shortcomings of vector-based methods, tedious feature engineering and lack of explainability.

Original toplevel document (pdf)

cannot see any pdfs







#RNN #ariadne #behaviour #consumer #deep-learning #patterns #priority #recurrent-neural-networks #retail #simulation #synthetic-data
The RNNs are made of high dimensional hidden states with non-linear dynamics.
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
recurrent connections is called recurrent neural networks (RNNs), which are capable of modelling sequential data for sequence recognition, sequence production, and time series prediction [15]. <span>The RNNs are made of high dimensional hidden states with non-linear dynamics. <span>

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7642454560012

Tags
#RNN #ariadne #behaviour #consumer #deep-learning #priority #recurrent-neural-networks #retail #simulation #synthetic-data
Question
Consumer histories are inherently sequential and of varying [...] T , making RNNs a natural model choice.
Answer
lengths

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
Consumer histories are inherently sequential and of varying lengths T , making RNNs a natural model choice.

Original toplevel document (pdf)

cannot see any pdfs







#deep-learning #keras #lstm #python #sequence
Normalization requires that you know or are able to accurately estimate the minimum and maximum observable values.
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
3.1. Prepare Numeric Data 3.1.1 Normalize Series Data Normalization is a rescaling of the data from the original range so that all values are within the range of 0 and 1. Normalization requires that you know or are able to accurately estimate the minimum and maximum observable values. You may be able to estimate these values from your available data. If your series is trending up or down, estimating these expected values may be difficult and normalization may not be

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7642459016460

Tags
#RNN #ariadne #behaviour #consumer #deep-learning #priority #recurrent-neural-networks #retail #simulation #synthetic-data
Question
Consumer behavior is inherently sequential which makes [...]s a perfect fit.
Answer
RNN

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
Consumer behavior is inherently sequential which makes RNNs a perfect fit.

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7642460327180

Tags
#feature-engineering #lstm #recurrent-neural-networks #rnn
Question
For this exercise, the authors developed two separate LSTM models. The first one predicted the likelihood that each donor was going to respond favorably to the solicitation (0/1), and we calibrated it on the entire calibration data (N = 61,928). The second LSTM model predicted the [...] t in case of donation, and we calibrated it on the individuals who donated in the calibration data (N = 6,456)
Answer
donation amount

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
ed the likelihood that each donor was going to respond favorably to the solicitation (0/1), and we calibrated it on the entire calibration data (N = 61,928). The second LSTM model predicted the <span>donation amount t in case of donation, and we calibrated it on the individuals who donated in the calibration data (N = 6,456) <span>

Original toplevel document (pdf)

cannot see any pdfs







#deep-learning #keras #lstm #python #sequence
as part of framing your problem you must split long sequences into subsequences that are both long enough to capture relevant context for making predictions, but short enough to efficiently train the network
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
se. This is a sensible approach that can work well in practice, but it is blind to temporal dependencies that span more than 20 time steps. — Training Recurrent Neural Networks, 2013 This means <span>as part of framing your problem you must split long sequences into subsequences that are both long enough to capture relevant context for making predictions, but short enough to efficiently train the network <span>

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7642464521484

Tags
#deep-learning #keras #lstm #python #sequence
Question
LSTM [...] are comprised of weights and gates
Answer
cells

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
LSTM cells are comprised of weights and gates

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7642466618636

Tags
#causality #statistics
Question
In [...] data, it is unrealistic to assume that the treatment groups are exchangeable. In other words, there is no reason to expect that the groups are the same in all relevant variables other than the treatment.
Answer
observational

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
In observational data, it is unrealistic to assume that the treatment groups are exchangeable. In other words, there is no reason to expect that the groups are the same in all relevant variables other t

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7642468191500

Tags
#causality #statistics
Question
Regular Bayesian networks are purely [...] models, so we can only talk about the flow of association in Bayesian networks.
Answer
statistical

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
Regular Bayesian networks are purely statistical models, so we can only talk about the flow of association in Bayesian networks.

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7642473172236

Tags
#abm #agent-based #machine-learning #model #priority #synergistic-integration
Question
Traditionally, two modeling methodologies, known as top-down and [...], modeling approaches have been widely applied to model complex systems
Answer
bottom-up

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
Traditionally, two modeling methodologies, known as top-down and bottom-up, modeling approaches have been widely applied to model complex systems

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7642474745100

Tags
#causality #statistics
Question
We will denote [...] of 𝑋 by de(𝑋)
Answer
descendants

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
We will denote descendants of 𝑋 by de(𝑋)

Original toplevel document (pdf)

cannot see any pdfs







#Data #GAN #reading #synthetic
In order for synthetic data — completely artificial and private production data — to be useful, it must maintain these relationships (referential integrity) among the data, including preserving the data links that can stretch across multiple tables.Referential integrity
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
Referential integrity is a structural element that ensures that the relationships among different sets of data are accurate. In order for synthetic data — completely artificial and private production data — to be useful, it must maintain these relationships among the data, including preserving the data links that can stretch across multiple tables.

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7642481036556

Tags
#Data #GAN #reading #synthetic
Question
In generating synthesised data, normally we use the finest [...]. For instance, order_id would represent a store managing orders, or person_id could represent a population.
Answer
granularity

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
In generating synthesised data, normally we use the finest granularity. For instance, order_id would represent a store managing orders, or person_id could represent a population.

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7642482347276

Tags
#recurrent-neural-networks #rnn
Question
We propose and implement a flexible methodological framework that provides marketing managers with highly accurate forecasts of fine granularity both in the short and in the long run. Our method also captures [...] and customer-level dynamics and allows to differentiate between different customer groups
Answer
seasonal peaks

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
implement a flexible methodological framework that provides marketing managers with highly accurate forecasts of fine granularity both in the short and in the long run. Our method also captures <span>seasonal peaks and customer-level dynamics and allows to differentiate between different customer groups <span>

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7642484968716

Tags
#feature-engineering #lstm #recurrent-neural-networks #rnn
Question
Feature engineering has been used broadly to refer to multiple aspects of feature [...], extraction, and transformation
Answer
creation

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
Feature engineering has been used broadly to refer to multiple aspects of feature creation, extraction, and transformation

Original toplevel document (pdf)

cannot see any pdfs







#RNN #ariadne #behaviour #consumer #deep-learning #priority #recurrent-neural-networks #retail #simulation #synthetic-data
In principle, one could evaluate the logistic regression model at every single time-step in the consumer history to determine the influence of individual events
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
In principle, one could evaluate the logistic regression model at every single time-step in the consumer history to determine the influence of individual events. However, this would involve the inefficient process of re-calculating features for every time-step. Calculations at timesteps t and t − 1 would be highly redundant: features at t repre

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7642488114444

Question

To create an environment with a specific version of Python and multiple packages including a package with a specific version:

$ conda create -n <env_name> [...]=<version#> <packagename> <packagename> <packagename>=<version#>
Answer
python

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
To create an environment with a specific version of Python and multiple packages including a package with a specific version: $ conda create -n <env_name> python=<version#> <packagename> <packagename> <packagename>=<version#>

Original toplevel document

How to Manage Python Dependencies with Conda - ActiveState
rmine the Current Environment with Conda The current or active environment is shown in parentheses () or brackets [] at the beginning of the Anaconda Prompt or terminal: (<current_env>) $ <span>Recommendations for Avoiding Dependency Conflicts with Conda There are two simple rules to follow: Always create a new environment for each project Install all the packages that you need in the new environment at the same time. Installing packages one at a time can lead to dependency conflicts. To create an environment with a specific version of Python and multiple packages including a package with a specific version: $ conda create -n <env_name> python=<version#> <packagename> <packagename> <packagename>=<version#> Alternatively, you can use conda to install all the packages in a requirements.txt file. You can save a requirements.txt file from an existing environment, or manually create a new requirements.txt for a different environment. To create a conda requirements.txt file from an existing environment: Activate your project environment. See section above entitled “How to Activate an Environment with Conda” for detai







Flashcard 7642489425164

Tags
#deep-learning #keras #lstm #python #sequence
Question

We can summarize the 3 key benefits of LSTMs as:

2. Possesses memory to overcome the issues of [...] temporal dependency with input sequences

Answer
long-term

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
We can summarize the 3 key benefits of LSTMs as: 2. Possesses memory to overcome the issues of long-term temporal dependency with input sequences

Original toplevel document (pdf)

cannot see any pdfs