Custom neural network implementation in TensorFlow to compare normalisation vs. no normalisation on data

Question

I am performing a sports prediction multi-class classification problem, and wanted to compare the differences in model performance between normalised and non-normalised data. You can see the 2 different datasets are X_train and X_train_normalised.

I wanted to know if my implementation below to compare them would be correct or could be improved in any way.

# one hot-encoding y_train and y_test
y_train = tf.keras.utils.to_categorical(y_train, num_classes=3)
y_test = tf.keras.utils.to_categorical(y_test, num_classes=3)

# create a model with 2 hidden layers using the traing and test data
model = tf.keras.models.Sequential()
model.add(tf.keras.layers.Dense(100, input_dim=127, activation='relu'))  # TODO was 127
model.add(tf.keras.layers.Dropout(0.3))                               
model.add(tf.keras.layers.Dense(64, activation='relu'))                 # TODO was 50
model.add(tf.keras.layers.Dropout(0.3))
model.add(tf.keras.layers.Dense(3, activation='softmax'))
model.summary()

# Model for normalized data
model_normalized = tf.keras.models.Sequential()
model_normalized.add(tf.keras.layers.Dense(100, input_dim=127, activation='relu'))  
model_normalized.add(tf.keras.layers.Dropout(0.3))                               
model_normalized.add(tf.keras.layers.Dense(64, activation='relu'))                 
model_normalized.add(tf.keras.layers.Dropout(0.3))
model_normalized.add(tf.keras.layers.Dense(3, activation='softmax'))
model_normalized.summary()

# Compile the model
model.compile(loss = 'categorical_crossentropy', metrics=['accuracy', 'Precision', 'Recall'], optimizer='adam')
model_normalized.compile(loss = 'categorical_crossentropy', metrics=['accuracy', 'Precision', 'Recall'], optimizer='adam')

# Define the EarlyStopping callback
early_stopping = EarlyStopping(monitor='val_loss', patience=3, restore_best_weights=True)

history = model.fit(X_train, y_train, epochs=100, batch_size=64, validation_split=0.1, callbacks=[early_stopping])

history_normalized = model_normalized.fit(X_train_normalized, y_train, epochs=100, batch_size=64, validation_split=0.1, callbacks=[early_stopping])

# Plot accuracy history
plt.plot(history.history['accuracy'])
plt.plot(history.history['val_accuracy'])
plt.title('model accuracy')
plt.ylabel('accuracy')
plt.xlabel('epoch')
plt.legend(['train', 'val'], loc='upper left')
plt.show()

# Plot loss history
plt.plot(history.history['loss'])
plt.plot(history.history['val_loss'])
plt.title('model loss')
plt.ylabel('loss')
plt.xlabel('epoch')
plt.legend(['train', 'val'], loc='upper left')
plt.show()


# Plot accuracy history for normalized data
plt.plot(history_normalized.history['accuracy'])
plt.plot(history_normalized.history['val_accuracy'])
plt.title('model accuracy')
plt.ylabel('accuracy')
plt.xlabel('epoch')
plt.legend(['train', 'val'], loc='upper left')
plt.show()

# Plot loss history for normalized data
plt.plot(history_normalized.history['loss'])
plt.plot(history_normalized.history['val_loss'])
plt.title('model loss')
plt.ylabel('loss')
plt.xlabel('epoch')
plt.legend(['train', 'val'], loc='upper left')
plt.show()

After evaluating both models using:

# Evaluate the model on the test data
score = model.evaluate(X_test, y_test, verbose=2)
score_normalized = model_normalized.evaluate(X_test_normalized, y_test, verbose=2)

The non normalised model gets 100% test accuracy, and the normalized model gets 82%.

I have as you can see in the above implementation implemented generalization methods such as dropout and made sure no duplicate target features are in my dataset. However is this normal to get 100% accuracy in test set also?

Building on the "DRY" answer by @toolic: You seem to be building two networks that are identical. You could return the model from a function and call the function twice. How did your plots look? Was either significantly better? It's possible that different model parameters would work better for each dataset (learning rate / batch size), but maybe not. Your focus on accuracy probably works for your use case. In my work recall is more important - just something else to consider. — Branden Keck
– Branden Keck, Commented Feb 18, 2024 at 22:43

toolic · Accepted Answer · 2024-02-17 17:21:25Z

2

DRY

You have a group of lines of code that is mostly repeated. Place as much of it into a function as possible, something like:

def plot_data(kind):
    plt.title('model' + kind)
    plt.ylabel(kind)
    plt.xlabel('epoch')
    plt.legend(['train', 'val'], loc='upper left')
    plt.show()

# Plot accuracy history
plt.plot(history.history['accuracy'])
plt.plot(history.history['val_accuracy'])
plot_data('accuracy')

# Plot loss history
plt.plot(history.history['loss'])
plt.plot(history.history['val_loss'])
plot_data('loss')

# Plot accuracy history for normalized data
plt.plot(history_normalized.history['accuracy'])
plt.plot(history_normalized.history['val_accuracy'])
plot_data('accuracy')

# Plot loss history for normalized data
plt.plot(history_normalized.history['loss'])
plt.plot(history_normalized.history['val_loss'])
plot_data('loss')

edited Feb 17, 2024 at 17:21

answered Feb 17, 2024 at 11:59

toolic

15.7k5 gold badges29 silver badges216 bronze badges

\$\begingroup\$ Thank you very much, that makes sense to be honest. Is there anything wrong with the actual neural network implementation you have noticed or does it seem ok? \$\endgroup\$

pastybake2002
– pastybake2002

2024-02-17 12:26:22 +00:00
Commented Feb 17, 2024 at 12:26
\$\begingroup\$ @pastybake2002: You're welcome. I wish I could give you advice regarding the implementation, but that's above my pay-grade :) \$\endgroup\$

toolic
– toolic

2024-02-17 12:32:35 +00:00
Commented Feb 17, 2024 at 12:32
\$\begingroup\$ haha, thats a shame \$\endgroup\$

pastybake2002
– pastybake2002

2024-02-17 13:23:44 +00:00
Commented Feb 17, 2024 at 13:23

Add a comment |

Stack Exchange Network

Custom neural network implementation in TensorFlow to compare normalisation vs. no normalisation on data

1 Answer 1

You must log in to answer this question.

Hot Network Questions

Custom neural network implementation in TensorFlow to compare normalisation vs. no normalisation on data

1 Answer 1

You must log in to answer this question.

Related

Hot Network Questions