Edited, memorised or added to reading queue

on 28-Oct-2024 (Mon)

Do you want BuboFlash to help you learning these things? Click here to log in or create user.

#data-science #infrastructure
A typical bottleneck is caused by the fact that humans can’t deliver software (or hardware, if operating outside the cloud) fast enough. Even if they were capable of hacking code fast enough, they may be busy maintaining existing systems, which is another critically human activity. This observation helps us to realize that although “infrastructure” sounds very technical, we are not building infrastructure for the machines. We are building infrastructure to make humans more productive. This realization has fundamental ramifications to how we should think about and design infrastructure for data scientists— for fellow human beings, instead of for machines. For instance, if we assume that human-time is more expensive than computer-time, which is certainly true for most data scientists, it makes sense to use a highly expressive, productivity-boosting language like Python instead of a low-level language like C++, even if it makes workloads more inefficient to process. We will dig deeper into this question in chapter 5
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs




Flashcard 7663105740044

Tags
#RNN #ariadne #behaviour #consumer #deep-learning #priority #recurrent-neural-networks #retail #simulation #synthetic-data
Question
Recognizing relevant patterns in long input streams (> 100 actions) can turn out to be difficult for the human mind. To achieve better [...], in many e-commerce applications consumer behavior can be viewed on the level of sessions
Answer
explainability

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
Recognizing relevant patterns in long input streams (> 100 actions) can turn out to be difficult for the human mind. To achieve better explainability, in many e-commerce applications consumer behavior can be viewed on the level of sessions

Original toplevel document (pdf)

cannot see any pdfs







#Linux

If you have a list of files, but you only want to delete files older the a certain date, for example, a maildir folder with 5 years worth of email, and you want to delete everything older then 2 years, then run the following command.

find . -type f -mtime +XXX -maxdepth 1 -exec rm {} \;

The syntax of this is as follows.

  1. find – the command that finds the file
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

How to delete all files before a certain date in Linux
How to delete all files before a certain date in Linux Posted on: September 15, 2015 If you have a list of files, but you only want to delete files older the a certain date, for example, a maildir folder with 5 years worth of email, and you want to delete everything older then 2 years, then run the following command. find . -type f -mtime +XXX -maxdepth 1 -exec rm {} \; The syntax of this is as follows. find – the command that finds the files . – the dot signifies the current folder. You can change this to something like /home/someuser/mail/somedomain/someemail/cur or whatever path you need -type f – this means only files. D




#Linux

How to delete all files before a certain date in Linux

If you have a list of files, but you only want to delete files older the a certain date, for example, a maildir folder with 5 years worth of email, and you want to delete everything older then 2 years, then run the following command.

find . -type f -mtime +XXX -maxdepth 1 -exec rm {} \;

The syntax of this is as follows.

. (dot) – signifies the current folder. You can change this to something like /home/someuser/mail/somedomain/someemail/cur or whatever path you nee

statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

How to delete all files before a certain date in Linux
How to delete all files before a certain date in Linux Posted on: September 15, 2015 If you have a list of files, but you only want to delete files older the a certain date, for example, a maildir folder with 5 years worth of email, and you want to delete everything older then 2 years, then run the following command. find . -type f -mtime +XXX -maxdepth 1 -exec rm {} \; The syntax of this is as follows. find – the command that finds the files . – the dot signifies the current folder. You can change this to something like /home/someuser/mail/somedomain/someemail/cur or whatever path you need -type f – this means only files. Do not look at or delete folders -mtime +XXX – replace XXX with the number of days you want to go back. for example, if you put -mtime +5, it will delet




#deep-learning #keras #lstm #python #sequence
Product Recommendation . Given a sequence of past purchases for a customer, predict the next purchase for a customer
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
s about the weather over time, predict the expected weather tomorrow. Stock Market Prediction . Given a sequence of movements of a security over time, predict the next movement of the security. <span>Product Recommendation . Given a sequence of past purchases for a customer, predict the next purchase for a customer <span>

Original toplevel document (pdf)

cannot see any pdfs




#deep-learning #keras #lstm #python #sequence

The choice of activation function is most important for the output layer as it will define the format that predictions will take.

Binary Classification (2 class) : Logistic activation function, or sigmoid , and one neuron the output layer.

statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
layer: Regression: Linear activation function, or linear , and the number of neurons matching the number of outputs. This is the default activation function used for neurons in the Dense layer. <span>Binary Classification (2 class) : Logistic activation function, or sigmoid , and one neuron the output layer. Multiclass Classification (> 2 class) : Softmax activation function, or softmax , and one output neuron per class value, assuming a one hot encoded output pattern. <span>

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7663121730828

Question

ggradar: radar plots with [...] in R

Answer
ggplot

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Open it
ggradar: radar plots with ggplot in R







Flashcard 7663123565836

Tags
#data-science #infrastructure
Question
to conduct data science projects, a common infrastructure can help to increase the number of projects that can be executed simultaneously (volume), speed up the time to market (velocity), ensure that the results are robust ([...]), and make it possible to support a larger variety of projects
Answer
validity

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
ts, a common infrastructure can help to increase the number of projects that can be executed simultaneously (volume), speed up the time to market (velocity), ensure that the results are robust (<span>validity), and make it possible to support a larger variety of projects <span>

Original toplevel document (pdf)

cannot see any pdfs







#recurrent-neural-networks #rnn
we can conceal certain parts of the input sequence and train the model to predict what is missing, to ‘‘fill in the blanks”. If we always blank only the last element in a historical sequence, the model effectively learns to predict the most likely future, conditioned on the observed past. Applying this idea to customer transaction records, we can forecast sequences predicting future behavior. We next present our model architecture in detail
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
om the fact that these models can translate a sequence of input elements into a sequence of outputs. Different seq2seq models can be created depending on how we manipulate the input data; i.e., <span>we can conceal certain parts of the input sequence and train the model to predict what is missing, to ‘‘fill in the blanks”. If we always blank only the last element in a historical sequence, the model effectively learns to predict the most likely future, conditioned on the observed past. Applying this idea to customer transaction records, we can forecast sequences predicting future behavior. We next present our model architecture in detail <span>

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7663126973708

Tags
#feature-engineering #lstm #recurrent-neural-networks #rnn
Question
[...] == Hidden Markov Model
Answer
HMM

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
HMM == Hidden Markov Model

Original toplevel document (pdf)

cannot see any pdfs