The question you have to ask yourself is "Given I know the x-value, what knowledge do I gain about the y-value". And a small side note: you may only consider linear relationships.
Consider the first row. In the first image, if you know the x-value, then the y-value is
y = x, i.e. you know the y-value exactly. In the following two images, if the x-value is high, then the y-value is high too, but there is a random component too. Thus, the correlation coefficient is positive (i.e. knowing x helps in estimating y), but not 1 (i.e. there is no exact equation
y = a*x. In the middle image, there is no relationship between the x- and y-coordinate - they are purely random, so no correlation. In the right three images, it is the same story as in the left three images, but the sign is flipped.
Now consider the second row: in the first three cases, given you know the x-value, you can always infer the y-value, e.g.
y = x,
y = 0.5 x, and
y = 0.1 x. There is no random component in y, so if you know x, you also know the exact value of y, thus the correlation coefficient is 1. It doesn't matter if the formula is
y = 1 x or
y = 0.1 x (i.e. the slope of the line doesn't matter!) all that matters for the correlation coefficient is that there is such a linear coefficient which leads to an exact match.
In the third row, knowing x does give some knowledge on y. For example in the middle plot, you have a relationship of
y = x^2 + random, so there is indeed some relationship between x and y, but it is non-linear. Thus the correlation coefficient is zero. There is no way for you to say "a high x-value also leads to a high y-value" or "a high x-value leads to a low y-value".
I assume this plot is generated e.g. with Seaborn's
pairplot() function, though of course different functions exist for creating this kind of plot. It is important to note that the plots on the diagonal are fundamentally different plots than the other elements.
The off-diagonal plots are two-dimensional scatter plots, i.e. you draw a point for each data sample.
The diagonal plots are histograms of each feature, and not scatter plots. This is because a scatter plot of twice the same variable would always give you a straight line
y = x.
The histograms are actually quite useful as you can get a feeling for the data and make a guess about the underlying distribution.
For example, your
total_rooms variable is heavily skewed, most houses have very few rooms, but there are a few outliers with huge numbers of rooms. I thus wouldn't trust the mean value of
total_rooms much and rather use the median - of course this is highly dependent on what kind of analyse you are doing.